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America’s carbon compromise 


As looming tax increases and budget cuts threaten to plunge the US economy back into recession, 
Congress should take a hard look at introducing a carbon tax as animportant part of the solution. 


House knowing that he was poised on the edge of a fiscal cliff. 

Rather than relishing his victory last week, Obama must 
immediately set about crafting a compromise on deficit reduction with 
congressional leaders. The stakes could hardly be higher — for science, 
for US citizens and, indeed, for the world. In the event of failure, a 
budgetary time-bomb of tax increases and sweeping budget cuts will 
detonate on 2 January. As well as resulting in indiscriminate cuts to 
funds for scientific research and many other areas, it could knock the 
United States back into recession and deliver yet another blow to an 
already fragile global economy. 

Faced with such dire consequences, one might expect that all the 
financial options would be on the table, especially the good ones. 
Unfortunately, this is not the case, at least not yet. 

So far, lawmakers have rehashed long-standing disputes about the 
size of government and the social safety net, but have ignored ideas 
that could transform the fiscal challenge into an opportunity. One such 
proposal is the carbon tax, which could bring financial and political 
benefits for all and chart a new course forward for energy independ- 
ence and global warming (see page 309). It is a solution that is every 
bit as improbable as it is logical, but one should remember Winston 
Churchill's assessment of the United States’ tendency to do the right 
thing — once all the alternatives have been exhausted. 

Just consider the possibilities. To put a levy on carbon would raise 
revenues that could be used to offset lower tax rates for individuals and 
businesses. This is what conservatives say they want to do. It would 
put more income — and thus choice — in the hands of consumers. 
Economists like the idea for more fundamental reasons. Generally, it 
is best to tax things that one wishes to discourage (such as smoking) 
rather than those that should be encouraged (such as working). Envi- 
ronmentalists like the idea of a carbon tax because it could generate 
some much-needed revenue for clean-energy research and develop- 
ment while reducing carbon emissions. 

The numbers are not negligible. An analysis conducted in August 
by economists at the Massachusetts Institute of Technology (MIT) 
in Cambridge showed that a carbon tax of US$20 per tonne of 
carbon dioxide from fossil fuels, if instituted in 2013 and increased by 
4% per year, would raise $1.5 trillion over the course of a decade. Aver- 
aged out, this amounts to $150 billion annually — a sizeable chunk 
of the trillion-dollar deficits that the US government has been run- 
ning in recent years. Scholars at the Brookings Institution, a centrist 
think tank in Washington DC, advocate ramping federal investments 
in energy research up from $3.8 billion now to $30 billion annually, to 
drive down the cost of low-carbon energy (including cleaner-burning 
coal). It is an ambitious proposal, and would leave a pile of cash that 
could be redistributed elsewhere for beneficial use. 

Conservatives loathe taxes, and US politicians obsess over energy 
prices, but a revenue-neutral carbon tax would get around these 
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problems. The MIT analysis found that the economy benefited 
regardless of whether the money was reinvested in social programmes 
or redistributed in the form of lower taxes and cash payments to offset 
higher energy costs for the poor. For environmentalists, the problem 
with a carbon tax is that it does not technically limit emissions, but the 
MIT model suggests that it would perform quite well: carbon emissions 
fall to 14% below 2006 levels by 2020 as consumers and businesses find 
ways to reduce their energy use in response to higher prices. 
Opposition to the idea may not be what it 


“A carbon tax was. For example, on 13 November, the Amer- 
would depend ican Enterprise Institute hosted a conference 
on political in Washington DC on the economics of a car- 
courage and a bon tax. The institute is a conservative think 
break withparty _ tank, and its officials have previously raised 


doubts about climate science. The idea has 
also been bubbling up in other right-leaning 
think tanks as a conservative solution to reduce greenhouse gases. 
The problem is that to enact a carbon tax would depend on political 
courage and a willingness to break with party orthodoxy, rare traits in 
Washington today. President Obama has made energy and climate part 
of his agenda for the second term, but his first — and perhaps biggest 
— opportunity to make good on that promise will come in the next few 
weeks. As US politicians contemplate diving into the fiscal abyss, they 
would be wise to consider a painless policy that benefits everyone. m 


orthodoxy.” 


A shaky restart 


Japan still has lessons to learn from Fukushimaif 
it is to convince the public about nuclear energy. 


Japan uncovered serious flaws in the country’s nuclear-safety 

regulations. Japan learned its lesson: it started putting a pre- 
mium on safety, and is doing everything it can to assure a wary public 
that similar mistakes will not be made again. Well, that was the hope. 
Two recent revelations show that it could still do much more. 

The country’s Nuclear Regulation Authority (NRA) was set up to 
right the wrongs of the previous regulatory infrastructure. One of its 
first tasks was to draw up new safety standards for reactor operations. 
The NRA formed an investigation team of six experts, which held its 
first meeting on 25 October. The team is expected to submit its report 
in time for the NRA to put the standards up for public comment in the 
spring and to make them law in July 2013. 

Last week, Japanese media reported that four of those experts have 
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received regular stipends or one-time grants from the nuclear industry. 
Nuclear engineer Akio Yamamoto of Nagoya University, for example, 
has received at least ¥50,000 (US$630) over the past three years from 
each of three companies related to nuclear energy, including Nuclear 
Engineering, a firm in Osaka that is affiliated with Kansai Electric 
Power. Although there is no suggestion that Yamamoto has done any- 
thing wrong, he also received some ¥27 million in grants from eight 
nuclear-energy companies during that period, as well as an undisclosed 
amount from Mitsubishi Heavy Industries, which builds reactors. 

An NRA spokesman defended the team’s composition, arguing that 
the report will be used only as a reference for the five NRA commis- 
sioners who will ultimately decide on the policy. (Those commission- 
ers do not have similar ties with industry.) If the NRA had tried to rule 
out everyone with any connection to the industry when choosing the 
experts, the spokesman said, it would have run out of people. 

These are fair points, and the fact that the team members had to 
disclose their contributions at all was a laudable nod to transparency. 
But playing down the importance of the report by saying it will just be 
used as a reference is unconvincing. 

Much of the uproar over the handling of the Fukushima disaster came 
from the public perception that conflicts of interests led regulators, who 
were too tightly tied to the nuclear industry, to favour cost-savings over 
safety. The NRA, created in large part as an answer to that criticism, has 
itself been lambasted for moving many staff from the old regulatory 
structure to the new organization, including its head, Shunichi Tanaka. 
It seems that Japanese policy-makers, despite the many public dem- 
onstrations, still haven't got to grips with the tendency for conflicts of 


interest to lead to bad decisions and, even if they dont, to breed mistrust. 
Similarly troubling is the rush with which the government reopened 
two of the country’s shuttered nuclear reactors in July without fully 
evaluating the seismology of the ground beneath. Last week, at its sec- 
ond meeting, a subcommittee of the NRA could not confirm whether 
a fault line running under a seawater-intake 


“Japan was channel — used to cool the reactors in an 
supposed to emergency — is active. 

emerge witha At stake is whether the fault is a landslide 
new respect ‘for fault or a more dangerous, deeper tectonic 


one. The NRA has ordered Kansai Electric, 
the plant’s operator, to dig trenches to inves- 
tigate the geology more thoroughly. That should take less than two 
months, but existing facilities at the plant are in the way, making it 
much more complicated — and expensive. 

Even if the risk from that fault is trivial, as many think, critics point 
out that the threat of shaking from nearby faults, the potential size of 
a tsunami and the possibility of structural defects like those found at 
Fukushima have not been adequately characterized. 

Large sectors of the public opposed the reactor restarts with dem- 
onstrations of a fervour not seen in Japan in decades. The country had 
already proved that it could get by, at least in the short term, with no 
nuclear power. Some scientists had pointed out the uncertainty over 
the seismic fault, and suggested how to deal with it, before the reactors 
were restarted. Japan was supposed to emerge from the Fukushima 
crisis with a new respect for reactor safety and better awareness of the 
need to convince people of that safety. It hasn’t made a very good start. = 


reactor safety.” 


Save scientific sites 


The push to conserve cultural-heritage sites 
must not leave out areas of interest to science. 


Samarkand in Uzbekistan, home to the fifteenth-century obser- 

vatory of the astronomer Ulugh Beg. That is in part thanks to the 
United Nations Educational, Scientific and Cultural Organization 
(UNESCO) World Heritage Convention, which this week celebrates its 
40th anniversary (see page 328). Samarkand was put on the World Her- 
itage List in 2001; the listing gives itimportant protection from the ongo- 
ing political chaos that has followed the collapse of the Soviet Union. 

The observatory must have been lovely during the two decades that 
it was active. Contemporary reports describe splendid architecture 
and exquisite tiling and mosaics. Frescos illustrating the orbits of the 
planets and the exact positions of stars adorned the observatory’s inner 
walls. It was largely destroyed by God-fearing hordes in 1449, but the 
innovative work of its scientists survived to influence Western astron- 
omy and algebra. Using a sextant 40 metres in radius, astronomers at 
the observatory recalculated the positions of nearly 1,000 stars and 
compiled their results in the widely translated 1437 star catalogue Zij-i 
Sultani. They stabilized the sextant by anchoring it in a 2-metre-wide 
trench dug into a hill in the plane of the meridian. The measurements 
were of unprecedented accuracy: the astronomers used them to recal- 
culate trigonometric tables and to calculate the length of the sidereal 
year (the time taken for Earth to orbit the Sun once in relation to the 
fixed stars) to within one minute of the measurement now accepted. 
Archaeologists discovered the remains of the observatory in 1908. 

A place on the World Heritage List means that a site must be main- 
tained with international-standard conservation methods, and not 
spoiled with inappropriate building development. International 
inspection teams visit to ensure compliance. Inspections of Samar- 
kand revealed in the mid-2000s that conservation was under par, and 
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that city planners, supported by local politicians, alarmingly failed to 
respect the integrity of the site. UNESCO called for increased monitor- 
ing of the site and threatened to place it on the List of World Heritage in 
Danger — although it has so far avoided this designation. 

In the past few decades, scientists from many disciplines have devel- 
oped techniques — ranging from lasers to nuclear technologies and 
microbiology — for conserving and restoring artworks and monuments. 
Scientists at the Foundation for Research and Technology — Hellas in 
Heraklion, Greece, for example, invented a laser with one beam in the 
ultraviolet range and another in the infrared to clean a frieze on the 
Parthenon, part of a World Heritage Site, without damaging its surface. 
The widely publicized work on the frieze was completed in 2005. 

Understandably, scientists would like more funding to allow them 
to fine-tune such techniques. That is hard to justify generically — each 
archaeological site or monument has its own problems, with technical 
solutions that must be worked out on an individual basis. The story 
of Samarkand shows that politics — and thus the World Heritage List 
— is at least as important as science to the conservation of important 
monuments around the world. Funds for cultural-heritage technol- 
ogies must be maintained as part of a broad approach to consider 
cultural heritage more widely. City and regional plans to cope with 
climate change, for example, should be required to consider the impact 
on cultural heritage. The European Commission is quietly voicing 
support for such a push, and it should be encouraged to speak louder. 

Very few of the 962 entries on the World Heritage List involve sci- 
entific sites, perhaps because science is not automatically thought of as 
a part of culture. However, astronomers have begun to do something 
about this. In the 2009 International Year of Astronomy, astronomers 
worked with a UNESCO advisory group, the International Council 
on Monuments and Sites, to produce a list of astronomical sites that 
they think are, like the Samarkand Observatory, worth saving. These 
include the nineteenth-century Royal Observatory in Cape Town, 
South Africa. If this site is designated as a World 
Heritage Site, interest surrounding it might 
encourage the much-desired development of 
science in the country. Other scientists should 
follow the astronomers’ example. = 
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warming is a worse problem than they thought a few years ago. The 

seriousness of this change is not appreciated by politicians and the 
public. The scientific world carefully measures the speed with which 
we approach the cliff and will, no doubt, carefully measure our rate of 
fall. But it is not doing enough to stop it. 1am a specialist in investment 
bubbles, not climate science. But the effects of climate change can 
only exacerbate the ecological trouble I see reflected in the financial 
markets — soaring commodity prices and impending shortages. 

My firm warned of vastly inflated Japanese equities in 1989 — the 
grandmother of all bubbles — US growth stocks in 2000 and every- 
thing risky in late 2007. The usual mix of investor wishful thinking 
and dangerous and cynical encouragement from 
industrial vested interests made these bubbles pos- 
sible. Prices of global raw materials are now rising 
fast. This does not constitute a bubble, however, 
but is a genuine paradigm shift, perhaps the most 
important economic change since the Industrial 
Revolution. Simply, we are running out. 

The price index of 33 important commodities 
declined by 70% over the 100 years up to 2002 — 
an enormous help to industrialized countries in 
getting rich. Only one commodity, oil, had been 
flat until 1972 and then, with the advent of the 
Organization of the Petroleum Exporting Coun- 
tries, it began to rise. But since 2002, prices of 
almost all the other commodities, plus oil, tripled 
in six years; all without a world war and without 
much comment. Even if prices fell tomorrow by 
20% they would still on average have doubled in 
10 years, the equivalent of a 7% annual rise. 

This price surge is a response to global population growth and the 
explosion of capital spending in China. Especially dangerous to social 
stability and human well-being are food prices and food costs. Growth in 
the productivity of grains has fallen to 1.2% a year, which is exactly equal 
to the global population growth rate. There is now no safety margin. 

Then there is the impending shortage of two fertilizers: phosphorus 
(phosphate) and potassium (potash). These two elements cannot be 
made, cannot be substituted, are necessary to grow all life forms, and are 
mined and depleted. It’s a scary set of statements. Former Soviet states 
and Canada have more than 70% of the potash. Morocco has 85% of 
all high-grade phosphates. It is the most important quasi-monopoly in 
economic history. 

What happens when these fertilizers run out is a question I can’t get 
satisfactorily answered and, believe me, I have 
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Be persuasive. Be brave. 
Be arrested (if necessary 


A resource crisis exacerbated by global warming is looming, argues financier 
Jeremy Grantham. More scientists must speak out. 


fertilizer problem is seen also in the shocking lack of awareness on 
the part of governments and the public of the increasing damage to 
agriculture by climate change; for example, runs of extreme weather 
that have slashed grain harvests in the past few years. Recognition of 
the facts is delayed by the frankly brilliant propaganda and obfuscation 
delivered by energy interests that virtually own the US Congress. (It is 
not unlike the part played by the financial industry when investment 
bubbles start to form ... but that, at least, is only money.) We need oil 
producers to leave 80% of proven reserves untapped to achieve a stable 
climate. As a former oil analyst, I can easily calculate oil companies’ 
enthusiasm to leave 80% of their value in the ground — absolutely nil. 

The damaging effects of climate change are accelerating. James 
Hansen of NASA has screamed warnings for 30 
years. Although at first he was dismissed as a mad- 
man, almost all his early predictions, disturbingly, 
have proved conservative in relation to what has 
actually happened. In 2011, Hansen was arrested 
in Washington DC, alongside Gus Speth, the 
retired dean of Yale University’s environmental 
school; Bill McKibben, one of the earliest and 
most passionate environmentalists to warn about 
global warming; and my daughter-in-law, all for 
protesting over a pipeline planned to carry Cana- 
dian bitumen to refineries in the United States, 
bitumen so thick it needs masses of water even to 
move it. From his seat in jail, Speth said that he had 
held some important positions in Washington, but 
none more important than this one. 

President Barack Obama missed the chance 
of a lifetime to get a climate bill passed, and his 
great environmental and energy scientists John 
Holdren and Steven Chu went missing in action. Scientists are under- 
standably protective of the dignity of science and are horrified by pub- 
licity and overstatement. These fears, unfortunately, are not shared by 
their opponents, which makes for a rather painful one-sided battle. 
Overstatement may generally be dangerous in science (it certainly is 
for careers) but for climate change, uniquely, understatement is even 
riskier and therefore, arguably, unethical. 

It is crucial that scientists take more career risks and sound a more 
realistic, more desperate, note on the global-warming problem. 
Younger scientists are obsessed by thoughts of tenure, so it is prob- 
ably up to older, senior and retired scientists to do the heavy lifting. 
Be arrested if necessary. This is not only the crisis of your lives — it 
is also the crisis of our species’ existence. I implore you to be brave. = 


Jeremy Grantham is co-founder and chief investment strategist at 
GMO, and co-chair of the Grantham Foundation for the Protection of 
the Environment, in Boston, Massachusetts. 

e-mail: jeremy@granthamfoundation.org 
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scientific literature 


RESEARCH HIGHLIGHTS 


PHYSIOLOGY 


How cold triggers 
fat formation 


In frigid temperatures, mice 
ramp up the production of 
heat-generating brown fat by 
decreasing the levels of a small 
RNA molecule. 

Because brown fat burns 
energy — unlike its unpopular 
cousin, white fat — its 
production is an attractive 
target for obesity and diabetes 
therapies. Markus Stoffel at 
the Swiss Federal Institute of 
Technology in Zurich and his 
colleagues found that exposure 
to cold reduced the expression 
of microRNA-133 in brown 
and subcutaneous white 
fat. Inhibiting miRNA- 133 
promoted brown-fat formation, 
whereas forcing miRNA-133 
expression switched off brown- 
fat production. 

The small RNA acted by 
directly inhibiting PRDM16, 

a protein that is central to the 
production of brown fat from 
white-fat-cell precursors. 
Nature Cell Biol. http://dx.doi. 
org/10.1038/ncb2612 (2012) 


Thinning ice more 
fragile and mobile 


The pronounced thinning 
of Arctic sea ice has made 
the ice pack more brittle and 
susceptible to wind drift. 

The volume of Arctic sea ice 
decreased by one-third during 
2007-11 compared with the 
1979-2006 mean. In a model 
simulation, Jinlun Zhang at 
the University of Washington 
in Seattle and his colleagues 
demonstrate that the decline in 
volume substantially reduces 
the mechanical strength of 
the ice, thus boosting ice-drift 
speed and deformation rates. 

Forecasts of ice-edge 
locations will become more 
challenging as the thinning 


Thick-skinned but sensitive 


Crocodiles and alligators may sense their prey 
using tiny bumps on their mouths that are 


highly sensitive to touch. 


Kenneth Catania and Duncan Leitch at 
Vanderbilt University in Nashville, Tennessee, 
investigated the raised bumps — called 
integumentary sensory organs — in 18 American 
alligators (Alligator mississippiensis; hatchling 
pictured) and 4 Nile crocodiles (Crocodylus 
niloticus). Confocal microscopy revealed that 
the bumps (pictured yellow), which are packed 


and weakening of sea ice leads 
to a state of free drift, the 
authors note. 

Geophys. Res. Lett. http://dx.doi. 
org/10.1029/2012GL053545 
(2012) 


| NEUROSCIENCE 
Blind reading 
with sounds 


Blind adults taught to ‘read’ 
using sounds that represent 
letters use the same area of the 
brain’s visual cortex that sighted. 
humans use when reading. 
Using a program that 
‘describes’ images in sound, 
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most tightly around the teeth and mouth, share 
similar structures with tactile skin receptors in 


mammals. The bumps seemed to be insensitive 


Amir Amedi at the Hebrew 
University of Jerusalem in 
Israel and his team trained 
eight congenitally blind people 
to decipher the shapes of letters 
and objects such as faces and 
tools. The authors then imaged 
participants’ brains as they 
listened to sounds associated 
with letters or other objects. 
When the volunteers read using 
the sounds, they activated the 
same part of the visual cortex 
— the visual word form area 
(VWFA) — as sighted controls 
did when viewing letters. 

The work shows that the 
VWEA is not dependent on 
visual information alone, 


to electrical current or water salinity, but showed 
nerve responses when stimulated with a range of 
levels of force — responding to low levels witha 
sensitivity exceeding that in primate fingertips. 
The authors suggest that the animals use the 
sensitive bumps to locate prey, and to identify 
food and other items inside their mouths. 
J. Exp. Biol. 215, 4217-4230 (2012) 


and suggests that the visual 
cortex may be trained to help 
recognize the environment, 
even in those who are blind 
from birth. 

Neuron 76, 640-652 (2012) 


MATERIALS 


Counting loops 
in gels 


Polymer networks, such as 
rubber and nylon, consist of 
linked chain-like or branched 
molecules that almost always 
contain loops — structural 
imperfections that weaken a 
network’s connectivity and 
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lower the material's elasticity. 

Jeremiah Johnson 
and his colleagues at the 
Massachusetts Institute of 
Technology in Cambridge 
have developed a method to 
count the number of the most 
common loops in polymeric 
materials. The authors broke 
a hydrogel, a type of polymer 
network that soaks up water, 
into quantifiable fragments 
that reflected the connectivity 
of the original network, then 
used mass spectrometry to 
count the loops. They found 
that too many loops prevented 
the gel from forming. 

The researchers say they 
are now using their method to 
correlate the effects of loops on 
the mechanical properties of a 
variety of polymer networks. 
Proc. Natl Acad. Sci. USA 
http://dx.doi.org/10.1073/ 
pnas.1213169109 (2012) 
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Wrens learn as 
embryos in the egg 


A single song element is all that 
superb fairy-wren nestlings 
need to include in their 
begging calls to get fed by their 
mothers, and, in an unusual 
example of prenatal learning, 
the nestlings seem to learn this 
‘password as embryos. 

Adult superb fairy-wrens 
(Malurus cyaneus; pictured) 
use these begging calls to 
distinguish their offspring 
from those of two cuckoo 
species that often invade 
their nests. Sonia Kleindorfer 
at Flinders University in 
Adelaide, Australia, and her 
team analysed recordings of 
the fairy-wren calls and found 
that each nest had a common 
begging call different from 

those ofall other 


nests. That call contained a 
signature element also found in 
the call the mother made while 
incubating her eggs. When the 
team swapped eggs around 
across 22 nests, nestlings from 
those eggs begged using the 
calls of their foster, not their 
biological, mothers, suggesting 
that the calls were learned. 

Curr. Biol. http://dx.doi. 
org/10.1016/j.cub.2012.09.025 
(2012) 


Brain-machine 
does the two-step 


Brain-machine interfaces 
(BMIs) detect and use brain 
activity to perform an intended 
task, and could be invaluable to 
people with paralysis. Typically, 
BMIs are able to process only 
single movements, but one 
developed by Ziv Williams at 
Harvard Medical School in 
Boston, Massachusetts, and 
his colleagues can control a 
series of motions — potentially 
expanding the complexity of 
tasks that BMIs can perform. 
The team recorded brain 
activity in monkeys that were 
trained to move a computer 
cursor with their paws to each 
of two areas ona screen ina 
particular order. This revealed 
activity in two distinct groups 
of neurons in the brain’s 
premotor cortex that was 
associated with each of the 
upcoming movements. The 
authors then programmed a 
computer to decode this signal 
from the brain and found that 
the mind-controlled computer 
moved the cursor at about the 
same speed that the monkeys 
achieved with their paws. 
Nature Neurosci. http://dx.doi. 
org/10.1038/nn.3250 (2012) 


When neurons 
mature too early 


A genetic mutation linked 

to intellectual disability and 
autism causes the premature 
formation of functional 
connections between brain 
cells during a crucial window 
of development early in life. 


COMMUNITY 


CHOICE 


Mega-MOF’s super surface 


Metal-organic frameworks, or MOFs, are 

of interest for applications such as catalysis 
and gas storage. Researchers now report 

a method that allowed them to synthesize 

these porous crystals with record-breaking surface areas. 

Omar Farha at Northwestern University in Evanston, Illinois, 
and his colleagues created two copper-based MOFs, each with a 
surface area of approximately 7,000 square metres per gram. To 
help boost surface area, they used supercritical carbon dioxide 
to activate the MOFs, avoiding framework collapse, which can 
occur when the solvents used in MOF synthesis are removed. 

Moreover, the authors calculated that by using acetylenes, 
rather than more bulky phenyls, as links in their framework, 
they could further increase the theoretical maximum surface 
area of MOFs to as high as 14,600 square metres per gram, 
roughly 40% higher than some previous estimates, the team 
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suggests. 


J. Am. Chem. Soc. 134, 15016-15021 (2012) 


Mutations that inactivate 
one copy of the gene SYNGAP1 
often cause intellectual 
disability in humans. Gavin 
Rumbaugh of the Scripps 
Research Institute in Jupiter, 
Florida, and his team found 
that mice with a similar 
mutation produce neurons 
that mature too quickly after 
birth and become overactive 
ina brain region important for 
cognitive function. Mice with 
one copy of SYNGAPI have 
memory problems and are 
prone to seizures — a symptom 
in humans with the mutations. 

Correcting the mutation in 
mice after this developmental 
period had little effect on the 
symptoms, and introducing the 
mutation into adult mice did 
not affect neuronal function — 
suggesting that the activity of 
the SYNGAP1 protein during 
this developmental window has 
long-lasting effects. 

Cell 151, 1-15 (2012) 


More co-orbiters 
for Neptune 


Some astronomers think that 
Neptune (pictured) can no 
longer capture objects whose 
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orbits around the Sun are 


similar to its own. But Carlos 
and Ratl de la Fuente Marcos 
at the Complutense University 
of Madrid in Spain, have 
discovered that four objects 
originally classified as minor 
planets are actually co-orbiters 
that joined Neptune's orbit as 
recently as 50,000 years ago. 
The work brings to 14 the 
number of objects that, like 
Neptune, orbit the Sun every 
165 years. The four latest 
objects are not in the plane of 
the Solar System and follow 
eccentric paths. One is likely to 
diverge from its current path 
just 2,000 years from now. 
Astron. Astrophys. 547,L2 (2012) 


© NATURE.COM 

For the latest research published by 
Nature visit: 
www.nature,com/latestresearch 


NASA/SPL 


SEVEN DAYS 


| RESEARCH 
Stem-cell faker 


A scientist who last month 
fabricated a story about using 
induced pluripotent stem cells 
to treat patients with heart 
failure (see Nature 491, 7-8; 
2012) has retracted two of his 
papers. Hisashi Moriguchi, 

of the University of Tokyo, 
retracted research in Scientific 
Reports. He also withdrew 

his claim to be affiliated 

with Massachusetts General 
Hospital and Harvard Medical 
School, and to have received 
approval for his work from an 
institutional review board. 


Expensive errors 
Mistakes in accounting for 
research projects funded by 
the European Union in 2011 
may have amounted to 3% 
of the total spending in this 
area — around €360 million 
(US$460 million), according 
to a report published on 

6 November. The European 
Court of Auditors found 
that “over-declaration of 
costs by beneficiaries” was 
the most common mistake 
by claimants, but said that 
most errors were likely to be 
unintentional. See go.nature. 
com/nc69zw for more. 


Snow survey starts 
The Solid Precipitation 
Intercomparison Experiment, 
an international two-year 
project to measure the depth 
of snow and the amount of 
snowfall at 15 locations around 
the world, kicks off this week. 
See page 312 for more. 


Ethics code 


Members of the American 
Anthropological Association 
overwhelmingly approved 

new ethical guidelines for 
research, the professional body 
announced on 7 November. 
The rewrite was prompted 

by a controversy over 
anthropologists’ participation 


The news in brief 


Killer fungus claims UK ash trees 


‘Ash dieback is likely to kill nearly all of the 
United Kingdom's 90 million ash trees, despite 
the announcement ofa plan to control the 
fungal disease on 9 November. Diseased trees in 
nurseries will be destroyed, and an import ban 


in US military efforts in 
Iraq and Afghanistan (see 
go.nature.com/so2doa). The 
new code is rooted in more 
flexible principles, such as 
avoiding harm to research 
subjects and obtaining their 
informed consent, and not 
the hard-and-fast rules that 
the previous ethics code had 
included. 


Berlin’s biomedicine 


Berlin’s mayor Klaus 
Wowereit announced a deal 
on 6 November to form the 
Berlin Institute of Health, 
which will receive more than 
€300 million (US$380 million) 
in extra funding over the 

next five years, 90% of it from 


306 | NATURE | VOL 491 | 15 NOVEMBER 2012 
© 2012 Macmillan Publishers Limited. All rights reserved 


federal budgets. Scientists 
hope that the new centre, 
which will ally the Charité 
university clinic with the 

Max Delbriick Center for 
Molecular Medicine, can rival 
research powerhouses in the 
United States and Britain. See 
page 317 for more. 


Open-access funds 
Britain’s research-funding 
agencies will spend more than 
£100 million (US$159 million) 
over the next five years to pay 
for work that they funded to 
be free to read, the agencies 
announced on 8 November. 
The grants will come out of 
the United Kingdom's science 
budget and will be awarded 

to universities and other 
research institutions. Research 


will stay in place. The fungus (Hymenoscyphus 
pseudoalbidus) was initially misidentified by 
mycologists, making it hard for the European 
Union to control its spread across the continent. 
See go.nature.com/anzy6u for more. 


Councils UK (an umbrella 
body for seven agencies that 
spend a total of £2.8 billion a 
year) has said that from April 
2013, all work funded by the 
agencies must be published in 
an open-access format — but 
the grants for next year are 
sufficient to cover only 45% 
of papers. See go.nature.com/ 
d28egt for more. 


Fukushima clean-up 
The company responsible for 
decontaminating the ruined 
Fukushima Daiichi nuclear 
power plant in Japan says that 
the cost of doing so will soar. 
The Fukushima plant suffered 
meltdowns in three reactors 
on 11 March 2011 after being 
stricken by an earthquake and 
tsunami. On 7 November, 
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the Tokyo Electric Power 
Company, which owns the 
plant, announced that cleaning 
up the ruined reactors and 
surrounding countryside 
could cost ¥10 trillion 
(US$126 billion) — double the 
size of the clean-up fund set 
aside by the government. 


Space drive 


The British contribution to 
the European Space Agency 
will rise by £60 million 
(US$93 million) per year 

to £240 million, the UK 
chancellor George Osborne 
said in a speech at the Royal 
Society in London on 

9 November. He also named 
eight technological areas in 
which the United Kingdom 
could be a world leader, 
including satellites and space, 
synthetic biology, advanced 
materials and regenerative 
medicine. See go.nature.com/ 
mcyniw for more. 


Fat tax abandoned 
Denmark has abolished a 

tax on high-fat foods, one 

year into its controversial 
attempt to make its population 
healthier. The tax, which 
added 16 kroner (US$2.70) for 
every kilogram of saturated 

fat in high-fat products, drove 
up food prices and put jobs 

at risk, the Danish Ministry 

of Taxation said. “We have 

to try improving the public 
health by other means,’ said 
food minister Mette Gjerskov. 


TREND WATCH | 


Subsidies to lower the price of 


renewable energies, such as solar 


and wind, rose 24% on 2010 
values to reach US$88 billion 
in 2011, according to the 


International Energy Agency's 


World Energy Outlook 2012, 
released on 12 November. But 
that support was less than a 
sixth of the $523 billion used 
to artificially reduce the cost 
of oil, natural gas and coal last 


year. Fossil-fuel subsidies largely 
track the price of crude oil. See 
go.nature.com/i5jtlv for more on 


this year’s energy outlook. 


The country has also rejected 
a proposed tax on high-sugar 
foods. See go.nature.com/ 
hkgyts for more. 


Advice at risk 

A letter calling for the budget 
of the UK Parliamentary Office 
of Science and Technology 
(POST) to be protected 

from cuts was published 

on 7 November, signed by 

two former British science 
ministers, a Nobel laureate and 
many UK learned societies. 
POST, which provides 
politicians with analyses of 
scientific issues, could face cuts 
of up to £98,000 (US$156,000), 
or 17% of its total budget. See 
go.nature.com/p55ezf for more. 


PEOPLE 
Palaeontologist dies 


Farish Jenkins, a 
palaeontologist at Harvard 
University in Cambridge, 
Massachusetts, died on 

11 November, aged 72. 
Jenkins was a noted science 
communicator and lecturer. 
His discovery with colleagues 
of the fossil remains of 
Tiktaalik roseae — a four- 
legged, fish-like creature 

that seemed to show the 
evolutionary transition of 
vertebrates from water to land 
— appeared on the 6 April 
2006 cover of Nature and 
sparked a media frenzy. See 
go.nature.com/y5ehem for 
more. 


Energetic farewell 
Hu Jintao, China's outgoing 
president, said that protecting 
the environment must be a top 
priority for the country’s next 
government. In his opening 
speech to the 18th National 
Congress in Beijing on 

8 November, he told delegates 
that China must set a ceiling 
on its energy consumption. 
Hu (pictured) is likely to be 
succeeded this week as the 
head of China’s Communist 
Party by Xi Jinping, who will 
then become the country’s 
president in March 2013. 


Vaccine tribulations 


A large phase III clinical 

trial of a malaria vaccine 
candidate, RTS,S/ASO1, 

has reported disappointing 
results in infants who received 
their first injection between 

6 and 12 weeks of age. The 
vaccine reduced the number 
of episodes of clinical malaria 
over 12 months by arounda 
third for these babies, the key 


FOSSIL-FUEL SUBSIDIES DWARF RENEWABLES AID 


Higher energy prices and rising consumption of oil and gas led 
to a recent surge of subsidies for fossil fuels to US$523 billion. 


Subsidies (US$ billion, nominal prices) 


2007 2008 


2009 


§ Fossil-fuel subsidies 
| Renewables subsidies 
(excluding hydro) 


2010 2011 


SEVEN DAYS | THIS WEEK | 


20-21 NOVEMBER 
The European Space 
Agency council meets 
in Naples, Italy, to put 
together a multi-year 
spending plan. 
Www.esa.int 


22-23 NOVEMBER 
Ata European 

Council meeting in 
Brussels, heads of state 
negotiate the European 
Unions budget for 
2014-20, including 

the final amount 
apportioned to research. 
Around €80 billion 
(US$100 billion) has 
been proposed for the 
research framework 
(see Nature 489, 
188-189; 2012). 
go.nature.com/d7rq5u 


age group targeted by the trial. 
The protection is lower than 
that observed in a previous, 
smaller phase II trial of babies 
in the same age group and less 
than the 55% reported last year 
ina group of children who were 
vaccinated at between 5 and 17 
months of age. See go.nature. 
com/gmw9ib for more. 


Sequencing stir 
Genetic-testing company 
23andMe, based in Mountain 
View, California, will make 
anonymized customer data 
available to researchers, it 
said at the annual meeting 
of the American Society 

of Human Genetics in San 
Francisco, California, last 
week. At the same event, 
Pacific Biosciences of Menlo 
Park, California, announced 
technology improvements 
that allow genomic sections 
at least 5,000 base pairs long 
to be sequenced in a single 
read, compared with a few 
hundred base pairs for other 
technologies. See go.nature. 
com/jqfr3k for more. 


> NATURE.COM 
For daily news updates see: 
Www.nature.com/news 


15 NOVEMBER 2012 | VOL 491 | NATURE | 307 
© 2012 Macmillan Publishers Limited. All rights reserved 


NEWSIN FOCUS 


Doubts Sow’s DNA Creative funding The man who 
greet positive outcome of yields silk purse for for a biomedical challenged fertility 
heart therapy p.313 researchers p.315 §} powerhouse in Berlin p.317 dogma p.318 


Re-elected US President Barack Obama has won four more years in which to cement his legacy. 


Obama reasserts 


research focus 


But ‘fiscal cliff’ threatens science and climate goals. 


BY ERIC HAND, IVAN SEMENIUK, JEFF 
TOLLEFSON AND MEREDITH WADMAN 


S peaking last week in Washington DC, US 


President Barack Obama reminded voters 
of the plan they had effectively endorsed 
by re-electing him. One of his key objectives, 
Obama said, would be to ensure that the United 
States “is a global leader in research and tech- 
nology and clean energy, which will attract new 


companies and high-wage jobs to America”. 

A different objective will be in the spotlight 
this week when Obama visits New York, a 
city still recovering from the damage caused 
by Hurricane Sandy on 29 October. Climate 
change could make storms like Sandy more 
common in the future. And New York’s mayor, 
Michael Bloomberg, cited climate concerns 
when he endorsed Obama for re-election (see 
Nature 491, 167-168; 2012), saying that the 


president has “taken major steps to reduce our = 
carbon consumption, including setting higher & 
fuel-efficiency standards for cars and trucks”. 
When he accepted that endorsement, Obama 
acknowledged that “climate change is a threat 
to our children’s future, and we owe it to them 
to do something about it”. 

Yet this new opportunity to confront 
climate change and invest in science and 
technology comes with towering obstacles. 
The election did not end the polarization 
of Congress — Republicans retained con- 
trol of the House of Representatives, and 
the Democrats only slightly strengthened 
their narrow majority in the Senate. And 
the ‘fiscal cliff’ looms large — automatic tax 
increases and spending cuts, the legacy of 
earlier budget battles, will hit on 2 January 
unless the outgoing Congress finds a way to 
avert them in the session that begins this week 
(see Nature 487, 414-415; 2012). 

The cuts, totalling some US$136 billion, 
would apply to all discretionary spending 
next year, including defence, and would eat 
deeply into federal science budgets (see ‘At 
the precipice’). Congressional leaders expect 
Obama to play an active part in brokering a 
deal to avoid the fiscal cliff, which economists 
say could plunge the fragile US economy back 
into recession. The outcome will foreshadow 
Obama's prospects for achieving other objec- 
tives — including those relevant to science — 
during his second term. 


C. SOMODEVILLA/GETT’ 


CLIMATE OPPORTUNITY 
Obama may have to develop his climate plans 
without some high-profile lieutenants. Energy 
secretary Steven Chu and Environmental 
Protection Agency (EPA) administrator Lisa 
Jackson are both rumoured to be stepping 
down. During Obama’s first term, both 
became lightning rods for Republican attacks 
— Chu for his role in approving a $535-million 
government loan guarantee to Solyndra, a 
solar-energy company that later went bankrupt, 
and Jackson for implementing greenhouse-gas 
regulations. But even without Chu or Jackson, 
the administration’s approach to renewable 
energy and global warming would change very 
little, says Michael Gerrard, director of the 
Center for Climate Change Law at Columbia 
University in New York. “I suspect we would 
have continuity in the broad policy approaches.” 
Obama’s election victory, combined with 
growing alarm in the United States over 
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PEOPLE’S CHOICE 


Science at stake in state proposals 


During last week’s national election, US 
voters also weighed in on state ballot 
measures that affect research. 


Hope for higher education 

California voters approved temporary 

tax increases expected to generate US$6 
billion annually for schools and community 
colleges through 2017, with smaller 
revenues through 2019. Passage of the 
measure also halted imminent state budget 
cuts that would have cost the University 

of California (UC) and the California State 
University system $250 million each over 
one year. UC administrators expressed 
optimism that the measure heralds a more 
favourable fiscal and political climate for the 
institution, which has weathered years of 
state funding cutbacks. 

“Education and research are multi-year 
endeavours,” says Keith Yamamoto, vice- 
chancellor for research at UC San Francisco. 
“The impact of a lack of stable funding is 
substantial.” 

Steve Montiel, a UC spokesman, called 
the measure “a significant step towards the 
prospect of financial stability”. 


Experimentation with marijuana 
Colorado and Washington became the first 
states to legalize marijuana for non-medical 
purposes, permitting purchase and use 
of the drug by adults aged 21 and older. 
Residents of Colorado can also possess up 
to six marijuana plants. 

The new measures set the stage for 
a legal battle: federal law prohibits the 
substance, and the US Drug Enforcement 
Administration says that its policy “remains 
unchanged”. Opponents fear that looser 
marijuana laws could lead to more drug 
abuse in the United States. In studies of 


> severe weather events, such as Hurricane 
Sandy and the severe drought in the Midwest 
this summer, could bolster efforts to curb 
carbon emissions. Jackson laid the founda- 
tion for such reductions after the US Supreme 
Court ruled in 2007 that carbon dioxide is a 
pollutant, which allowed the EPA to regulate 
the gas under the Clean Air Act. Jackson went 
on to craft the first US greenhouse-gas reg- 
ulations for vehicles, and in March proposed 
arule that would effectively ban the construc- 
tion of new coal-fired power plants unless they 
are equipped to capture and sequester roughly 
50% of the carbon dioxide they emit. The 
agency next plans to propose rules for existing 
power plants, then oil refineries. The details 
of those rules are unclear. The EPA could, for 


drug use in the Netherlands since the 
country’s de facto legalization of marijuana 
in 1976, Robert MacCoun, a drug-policy 
expert at the University of California, 
Berkeley, has found only modest increases 
in marijuana use and no significant 
escalation to harder drugs (R. MacCoun and 
P. Reuter, Science 278, 47-52; 1997). 

“Of course, the Netherlands is a different 
country”, but it provides some of the only 
available data worldwide, MacCoun says. 
The Washington measure would direct 
some marijuana tax revenue towards drug- 
abuse research. 


No labelling for genetically modified food 
California voters rejected a measure that 
would have made theirs the first state 

to require labelling of foods containing 
genetically modified organisms (GMOs). 

Bob Goldberg, a plant geneticist at UC 
Los Angeles, who co-authored an argument 
against the proposal in the state’s voting 
guide, calls the measure “ideological and 
not evidence-based”. 

Marion Nestle, a food-politics expert at 
New York University, argues that labelling 
might have promoted greater consumer 
trust in GMOs. “Not having a choice induces 
paranoia in people,” she says. 

A poll conducted by the California 
Business Roundtable, a business-advocacy 
group based in Sacramento, and the School 
of Public Policy at Pepperdine University 
in Malibu, California, found that public 
support for labelling was at around 65% 
in August, then plummeted in the month 
before the election. The opposing campaign 
— backed heavily by Monsanto, a producer 
of genetically modified seeds based in St 
Louis, Missouri — escalated its television 
advertising in the final weeks. Helen Shen 


example, set energy-efficiency standards for 
different types of power plant or take a more 
flexible approach that would let states — which 
normally implement air-quality rules — decide 
how to proceed. 

There is also an outside possibility that 
Congress’s struggle to avoid the fiscal cliff 
could bring another approach to the fore: a 
carbon tax. To avoid the automatic spend- 
ing cuts and tax rises, lawmakers must find 
new ways to raise revenue. So far, the House 
has focused on closing tax loopholes, but 
those efforts are expected to come up short. 
As an alternative, a carbon tax has been 
quietly gaining traction in policy meetings, 
even among some conservatives. 

“A carbon tax is certainly not likely as a 
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starting point, but we think it could become 
extremely attractive if the lawmakers begin to 
run out of options for generating revenue,’ says 
Mark Muro, a senior fellow at the Brookings 
Institution, a non-partisan think tank in 
Washington DC. 

Although the idea of any tax — let alone one 
on carbon — is anathema to most conservatives 
on Capitol Hill, the idea offers something for 
everybody in the current budget crunch. A tax 
would reduce emissions and raise revenue for 
energy research and development — outcomes 
that environmentalists would welcome — and it 
would generate extra revenue that conservative 
lawmakers could use to offset lower tax rates on 
individuals and businesses. 

In August, economists at the Massachusetts 
Institute of Technology (MIT) in Cambridge 
reported that the United States could raise $1.5 
trillion over ten years and reduce emissions to 
14% below 2006 levels by 2020 by instituting 
a tax of $20 per tonne of carbon in 2013 and 
increasing it by 4% a year. Less than 20% of that 
revenue would be enough to fund a massive 
boost in federal investments in clean-energy 
research and development — from $3.8 bil- 
lion in 2012 to $30 billion annually, Muro says. 
The administration has yet to weigh in on the 
idea, and some on Capitol Hill think that is 
just as well. If pushed prematurely, the idea 
could wither in the political spotlight before 
lawmakers have a chance to fully consider 
its merits. “We're trying to generate interest,” 
a senior House aide told Nature. “The more 
discussion there is, the better.” 


FUNDING FEARS 
Basic research has historically fared well under 
Democratic and Republican administrations, 
but many university administrators were 
nonetheless relieved by the election’s outcome. 
They wondered whether Republican candidate 
Mitt Romney would side with other Republi- 
cans — including Paul Ryan, his running mate 
and chairman of the House budget commit- 
tee — who advocate severe spending cuts to 
government programmes as a way of reducing 
the deficit. “Thank God we don't have to find 
out,” says Stewart Smith, dean for research at 
Princeton University in New Jersey. 
Universities that receive grant money from 
federal science agencies are nonetheless bracing 
themselves for the fiscal cliff that awaits if 
Congress cannot reach a budget deal or 
find a way to extend the bargaining window 
before 2 January. The 
administration's Office 
of Management and 
Budget estimates that 
most funding agencies 


* 


would have their budgets 
> NATURE.COM slashed by 8.2% in the 
Visit Nature's absence of an agreement. 
election special: Claude Canizares, vice- 
www.nature.com/ president for research at 
election2012 MIT, says that the result 
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US Environmental Protection Agency administrator Lisa Jackson (left) and energy secretary Steven Chu (right) drew much Republican ire in 2008-12. 


would bea loss of $40 million in research rev- 
enue at the university, which drew $619 million 
in research support in 2011. An 8% cut in fund- 
ing to the US Department of Energy (DOE) 
would mean an 8% loss in staff at the DOE 
Princeton Plasma Physics Laboratory, of which 
Smith will become vice-president in January. 
Advocates for biomedical research are 
equally concerned about the prospect of the 
cuts. “I don’t know how you spare anyone 
or anything” at the US National Institutes of 
Health (NIH), says Jennifer Zeitzer, direc- 
tor of legislative affairs at the Federation of 
American Societies for Experimental Biology 
in Bethesda, Maryland. The automatic cuts 
would slash the agency's $30.7-billion budget 
by $2.5 billion, a portion of which would be 
exacted from every NIH institute and centre. 
That prospect, in combination with other 
pressures on medical research budgets, is 
“chilling”, says Ann Bonham, chief scientific 
officer at the Association of American Medical 
Colleges in Washington DC. She notes that the 
Patient Protection and Affordable Care Act, 
the health-care-reform law that is the signature 
policy achievement of Obama’s first term and 
will ultimately extend health insurance to more 
than 30 million now-uninsured US citizens, 
also mandates a $155-billion cut in govern- 
ment payments to hospitals. That could hurt 
the large teaching hospitals that support much 
US medical research (see Nature 487, 13-14; 
2012). Bonham worries that the confluence of 
stresses will consign the biomedical research 
enterprise to “death by 1,000 cuts”. 
Supporters of Obama’ health-care reform 
argue that it will eventually curb soaring US 


> 


MORE 
ONLINE 


government outlays for health care, thereby 
shrinking the deficit and producing more 
revenue for agencies, including the NIH. If the 
plan works, the reform is “likely to be a long- 
term gain for research’, says Ezekiel Emanuel, 
a medical ethicist and health-policy expert at 
the University of Pennsylvania in Philadelphia, 
who was the senior health-policy adviser to 
the White House budget office from 2009-11. 


AT THE PRECIPICE 


Annual US funding for non-military research and 
development will drop sharply if government-wide 
budget cuts take effect (red). If defence spending 
is spared, even deeper cuts will apply (blue). 
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Science could also benefit from another Obama 
goal: immigration reform. One result of such 
reform could be more ‘H-1B visas for foreign 
scientists and engineers, a need that both candi- 
dates emphasized during the campaign. 

In the meantime, Obama's science agenda 
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will depend on cooperation with a Congress 
that includes some new faces. Last week’s 
general election not only returned him to office 
and decided state ballot measures (see ‘People’s 
choice’); it also marked the end of some key 
lawmakers’ terms. The chairmanship of the 
Senate energy and water committee is being 
vacated by the retiring Jeff Bingaman, a Demo- 
crat from New Mexico. Some observers expect 
Ron Wyden, a Democrat from environmen- 
tally friendly Oregon, to fill the spot. Although 
generally liberal, Wyden is known for crossing 
the aisle to work with Republicans. “Wyden is 
a very thoughtful guy; he likes to think outside 
the box,” says Michael Lubell, director of pub- 
lic affairs for the American Physical Society in 
Washington DC. 

In the House, it’s unclear who will chair 
the spending subcommittee that funds the 
NIH. The current chairman, Denny Rehberg 
of Montana, relinquished his House seat in a 
failed bid for the Senate. The science, space 
and technology committee is losing its chair- 
man, Ralph Hall of Texas, who is stepping 
down in accordance with Republican term 
limits. Jim Sensenbrenner of Wisconsin, Dana 
Rohrabacher of California and Lamar Smith 
of Texas all want the job. Lubell says that his 
money is on Smith, a co-sponsor of patent- 
reform legislation who has also tried to make 
it easier for foreign graduates with science 
degrees to remain in the United States. 

For now, Obama’s victory has created an 
opening for compromise after two years of 
Congressional gridlock. With the fiscal clock 
ticking, the coming weeks may well set the tone 
for the next four years. m SEE EDITORIALP. 30! 
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Kasprowy Wierch, a summit in Poland, will soon get new snow-measuring instruments. 


Snow survey hopes 
for avalanche of data 


More accurate snowfall measurements could improve 
climate models and estimates of water resources. 


BY JANE QIU 


ountains are barometers of climate 
Meee but some of the simplest 

questions about them are the hard- 
est to answer. How much snow coats their 
peaks and slopes, for example? And how do 
these frosty shrouds alter from year to year? 
This week, an international programme kicks 
off to answer those questions. 

Ina two-year project called the Solid Precip- 
itation Intercomparison Experiment (SPICE), 
spearheaded by the World Meteorological 
Organization (WMO), climate scientists will 
deploy a suite of state-of-the-art snow gauges 
at 15 sites in geographically and climati- 
cally diverse countries around the world, up 


to 4,318 metres above sea level (see “White 
noise’). The goal is to make accurate measure- 
ments of snow depth and snowfall — the most 
fragile form of precipitation, which can elude 
or clog simple collecting devices — and come 
up with recommendations for the best ways to 
do snow surveys in different parts of the world. 
The results could improve climate models and 
help to predict permafrost stability, ecosystem 
changes and the availability of water resources 
in the coming decades. 

“Snowfall is an important part of the global 
hydrological cycle,” says Roger Atkinson, 
acting head of the WMO’s Instruments and 
Methods of Observation Programme in 
Geneva, Switzerland. “If we can’t accurately 
measure the amount of snowfall, then we wont 
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be able to know how much water we have and 
how it will change in the future.” 

Snowfall also “partly determines whether 
a glacier grows or retreats,” says Zhang 
Yinsheng, a climate scientist at the Chinese 
Academy of Sciences’ Institute of Tibetan 
Plateau Research in Beijing, who is not 
involved in the SPICE project. “People have 
been debating the fate of Himalayan glaciers 
for a long time, but we don't have a proper 
grasp of even the basics.” 

Although researchers can accurately assess 
some climate parameters such as temperature, 
pressure, wind speed and humidity, measur- 
ing snowfall remains challenging. Snowflakes 
are light and drift with the wind, and weather 
conditions can dramatically affect what pro- 
portion of snowfall is actually captured by 
researchers’ cylindrical metal gauges, says 
Rodica Nitu, a meteorological instrument 
expert at Environment Canada in Toronto who 
leads the project. And when the temperature is 
around freezing, the damp snow tends to stick 
to the rim of the container, soon forming acap 
and preventing further collection. 

“Undercatch is a serious problem’, says 
Roy Rasmussen, a climate scientist at the US 
National Center for Atmospheric Research in 
Boulder, Colorado. This is particularly the case 
with automatic gauges, which can capture as 
little as 20% of the actual snowfall, he says. 

Unreliable snow readings introduce one 
of the greatest unknowns to climate models, 
hampering the ability to predict future changes 
in water resources and mountain hazards, says 
Rasmussen. And snowfall, like other forms of 
precipitation, is expected to increase as the 
globe warms. Better snow data could help 
modellers to predict the increase in snowfall, 
and whether it will be sufficient to offset the 
increased melting of glaciers. 

The last major international effort to meas- 
ure snow was more than 20 years ago, and 
“there has been a lot of progress since then’, 
says Rasmussen. One of the main aims of the 
project is to test the range of recently devel- 
oped sensors, gauges and windshields. For 
example, field observations show that shields 
to reduce the horizontal wind speed above the 
gauge increase collection enormously. “It’s 
the most important factor for accurate snow 
measurements,’ says Rasmussen. 

New ways of heating the measuring gauges 
should also prevent snow capping without 
causing evaporation or air turbulence that 
blows the snow away. The field is also switch- 
ing from manual to automated instruments, 
enabling continuous measurements over large, 
hard-to-access areas. Relating the two data sets 
will make records of snow measurements con- 
tinuous over time, says Nitu. 

Zhang says that the project is timely and 
important, but that it misses crucial regions 
such as the Himalayas, where SPICE doesn’t 
have a testing site. Early next year, Zhang 
and his colleagues will set up a network of 
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stations across the Tibetan plateau and sur- 
rounding mountain ranges, at altitudes up to 
6,000 metres, which will gather accurate snow 
measurements across the region that could 
augment SPICE'’s results. 

In the longer term, however, “there will 
never be enough ground measurements 
to cover an entire mountain’, says Michael 
Lehning, a climate scientist at the Swiss Federal 
Institute for Snow and Avalanche Research in 
Davos-Dorf, who is involved in the project. 
Results from SPICE will be used to calibrate 
airborne and satellite-based sensors, which use 
techniques such as microwave, radar and laser 
ranging to survey much larger areas. 

“The idea is to push remote sensing to be 
accurate enough for use in mountains,” says 
Lehning. “It’s still a long way off, but SPICE is 
a good starting point” = 
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The SPICE project aims to test the most advanced techniques 


for measuring snowfall at sites around the world. 
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ALTERNATIVE MEDICINE 


Chelation-therapy 
heart trial draws fire 


Critics not persuaded that metal-snaring treatment works. 


BY EWEN CALLAWAY 


ith millions of Americans regularly 
using complementary medicines, 
researchers usually applaud efforts 


to test and debunk folk treatments such as 
echinacea, a herbal supplement often deployed 
against the common cold. But what ifa trial 
shows that an alternative therapy might work? 
That is the case for a study funded by the 
US National Center for Complementary 
and Alternative Medicine (NCCAM), part 
of the National Institutes of Health (NIH) 
in Bethesda, Maryland. The trial hints that 
a fringe therapy intended to sop up metal 
ions in the blood might reduce participants’ 
risk of heart attack. Critics are attacking 
both the rigour of the study and the records 
of some of its investigators, complicating the 
NCCAM‘s efforts to answer charges from some 
researchers that it funds quackery, and raising 
questions about whether the centre’s US$128- 
million annual budget is being spent wisely. 
The Trial to Assess Chelation Therapy 
(TACT) was a 10-year, $31.6-million study 
involving 1,708 participants at 134 centres. It 
aimed to test whether weekly infusions ofa salt 
of ethylenediaminetetraacetic acid (EDTA) can 
lower the risk of repeat heart attacks. EDTA is 
a chelating agent: the molecule is peppered 
with electron-rich nitrogen and oxygen atoms, 
which can grab and hold onto positive metal 
ions (see picture). The US Food and Drug 


Administration has approved one salt, calcium 
disodium EDTA, to treat lead poisoning. 
Proponents of chelation therapy for heart 
disease initially speculated that EDTA could 
also cleanse the blood of calcium ions, a com- 
ponent of the atherosclerotic plaques that 
block blood vessels. But evidence against that 
hypothesis led them to suggest alternative 
mechanisms, for example that the molecule 
captures other metals, preventing heart-dam- 
aging inflammation. In spite of the uncertainty, 
the treatment is already big business: a 2007 US 
government survey estimated that, every year, 
110,000 Americans undergo chelation therapy, 


The electron-rich oxygen (red) and nitrogen (dark 
blue) atoms in ethylenediaminetetraacetic acid 
can grab and hold onto positive metal ions (green). 
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which can cost thousands of dollars per course. 

According to TACT, which the NCCAM co- 
funded along with the National Heart, Lung, 
and Blood Institute (NHLBI), the therapy 
shows signs of working. On 4 November at the 
annual meeting of the American Heart Asso- 
ciation in Los Angeles, California, trial lead- 
ers reported that 26% of patients who received 
infusions of disodium EDTA went on to suffer 
a heart attack, stroke or other heart problem, 
compared with 30% of patients on a placebo — 
a statistically significant difference. 

Many medical researchers were quick to 
question the results. Perplexingly, the benefit 
was observed only among participants with 
diabetes, and 30% of participants dropped out 
of the trial, undermining comparison between 
the treatment and placebo. Critics also note 
that nearly two dozen trial co-investigators 
have been disciplined by state medical boards 
for infractions ranging from insurance fraud 
to providing ineffective treatments. “They 
offer aromatherapy, crystal therapy and every 
imaginable wacky form of medicine. You can't 
do high-quality research at sites like that,” says 
Steven Nissen, a cardiologist at the Cleveland 
Clinic in Ohio. “We wasted $30 million and 
10 years on an unreliable study.’ 

He worries that the research will be used 
to support unapproved chelation therapies, 
which have been linked to heart attacks and 
death. “Public harm is going to come out of 
this. People are going to get bilked out of a lot 
of money. People are going to die” 

Kimball Atwood, an anaesthesiologist at 
Tufts University School of Medicine in Boston, 
Massachusetts, and one of TACT’s most vocif- 
erous critics, argues that the trial has been trou- 
bled from the beginning. In a paper titled “Why 
the NIH Trial to Assess Chelation Therapy 
(TACT) should be abandoned’ (K. C. Atwood 
et al. Medscape J. Med. 10, 115; 2008), he 
claimed that trial proponents had mischarac- 
terized earlier studies of chelation therapy > 
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stations across the Tibetan plateau and sur- 
rounding mountain ranges, at altitudes up to 
6,000 metres, which will gather accurate snow 
measurements across the region that could 
augment SPICE'’s results. 

In the longer term, however, “there will 
never be enough ground measurements 
to cover an entire mountain’, says Michael 
Lehning, a climate scientist at the Swiss Federal 
Institute for Snow and Avalanche Research in 
Davos-Dorf, who is involved in the project. 
Results from SPICE will be used to calibrate 
airborne and satellite-based sensors, which use 
techniques such as microwave, radar and laser 
ranging to survey much larger areas. 

“The idea is to push remote sensing to be 
accurate enough for use in mountains,” says 
Lehning. “It’s still a long way off, but SPICE is 
a good starting point” = 
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The SPICE project aims to test the most advanced techniques 


for measuring snowfall at sites around the world. 
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Chelation-therapy 
heart trial draws fire 


Critics not persuaded that metal-snaring treatment works. 
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a fringe therapy intended to sop up metal 
ions in the blood might reduce participants’ 
risk of heart attack. Critics are attacking 
both the rigour of the study and the records 
of some of its investigators, complicating the 
NCCAM‘s efforts to answer charges from some 
researchers that it funds quackery, and raising 
questions about whether the centre’s US$128- 
million annual budget is being spent wisely. 
The Trial to Assess Chelation Therapy 
(TACT) was a 10-year, $31.6-million study 
involving 1,708 participants at 134 centres. It 
aimed to test whether weekly infusions ofa salt 
of ethylenediaminetetraacetic acid (EDTA) can 
lower the risk of repeat heart attacks. EDTA is 
a chelating agent: the molecule is peppered 
with electron-rich nitrogen and oxygen atoms, 
which can grab and hold onto positive metal 
ions (see picture). The US Food and Drug 


Administration has approved one salt, calcium 
disodium EDTA, to treat lead poisoning. 
Proponents of chelation therapy for heart 
disease initially speculated that EDTA could 
also cleanse the blood of calcium ions, a com- 
ponent of the atherosclerotic plaques that 
block blood vessels. But evidence against that 
hypothesis led them to suggest alternative 
mechanisms, for example that the molecule 
captures other metals, preventing heart-dam- 
aging inflammation. In spite of the uncertainty, 
the treatment is already big business: a 2007 US 
government survey estimated that, every year, 
110,000 Americans undergo chelation therapy, 
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blue) atoms in ethylenediaminetetraacetic acid 
can grab and hold onto positive metal ions (green). 


which can cost thousands of dollars per course. 
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shows signs of working. On 4 November at the 
annual meeting of the American Heart Asso- 
ciation in Los Angeles, California, trial lead- 
ers reported that 26% of patients who received 
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compared with 30% of patients on a placebo — 
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Many medical researchers were quick to 
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was observed only among participants with 
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of the trial, undermining comparison between 
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that nearly two dozen trial co-investigators 
have been disciplined by state medical boards 
for infractions ranging from insurance fraud 
to providing ineffective treatments. “They 
offer aromatherapy, crystal therapy and every 
imaginable wacky form of medicine. You can't 
do high-quality research at sites like that,” says 
Steven Nissen, a cardiologist at the Cleveland 
Clinic in Ohio. “We wasted $30 million and 
10 years on an unreliable study.’ 

He worries that the research will be used 
to support unapproved chelation therapies, 
which have been linked to heart attacks and 
death. “Public harm is going to come out of 
this. People are going to get bilked out of a lot 
of money. People are going to die” 

Kimball Atwood, an anaesthesiologist at 
Tufts University School of Medicine in Boston, 
Massachusetts, and one of TACT’s most vocif- 
erous critics, argues that the trial has been trou- 
bled from the beginning. In a paper titled “Why 
the NIH Trial to Assess Chelation Therapy 
(TACT) should be abandoned’ (K. C. Atwood 
et al. Medscape J. Med. 10, 115; 2008), he 
claimed that trial proponents had mischarac- 
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> by suggesting that their results were 
equivocal and necessitated a larger follow- 
up. Atwood says that the earlier studies in 
fact found that the treatment was ineffective 
at preventing heart attacks. 

In 2008, TACT was suspended after regu- 
lators learned that subjects were not being 
given calcium disodium EDTA, as implied 
on informed-consent forms — instead, they 
were being infused with the slightly differ- 
ent salt disodium EDTA, for which the FDA 
had revoked approval. The trial resumed 
after consent forms were reworded to 
include warnings, such as “death is a rare 
complication of EDTA infusions”. 

Josephine Briggs, director of the NCCAM, 
declined to comment on TACT until the 
results are published in a journal. The prin- 
cipal investigator, cardiologist Gervasio 
Lamas of Mount Sinai Medical Center in 
Miami Beach, Florida, says that the study's 
findings were a surprise and deserve follow- 
ing up. He adds that the trial consent forms 
were approved by the NIH and multiple 
institutional review boards. Gary Gibbons, 
director of the NHLBI, says that his institute 
stands by the study’s methodology. 

But critics charge that TACT is simply 
the latest example of dubious research 
into unproven therapies supported by the 
NCCAM. Some argue that even high- 
quality studies would have little value, 
because negative results are unlikely to 
sway ardent practitioners. “Show me one 
alternative medication or procedure that 
was studied, found to not work, and was 
abandoned by practitioners. I’m not aware 
of any,” says Steven Novella, a neurologist 
at Yale University in New Haven, Con- 
necticut. Briggs, who previously led the 
NIH’s kidney-disease research, points out 
that echinacea sales fell after an NCCAM- 
funded study showed it was ineffective 
against colds (R. B. Turner et al. N. Engl. J. 
Med. 353, 341-348; 2005). With the centre’s 
research showing that Americans spend 
about $34 billion on alternative medicine 
each year, “we think it’s really important to 
bring some science into this’, she says. 

Briggs adds that the NCCAM'’s critics 
often misrepresent the centre’s research, 
focusing on studies of herbal supplements 
such as lavender oil but ignoring multi- 
million-dollar grants for more-mainstream 
science. Among the largest studies funded 
by the centre this year are a computational 
analysis of the human microbiome and an 
effort to use brain imaging to understand 
and treat chronic back pain. 

Novella and other NCCAM critics 
do praise Briggs for bringing increased 
accountability to the centre, and for boost- 
ing the rigour of the research it funds. But 
“even if you did pristine research under the 
NCCAM,”, says Novella, “it’s what you're 
studying that is the problem”. m 
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Pig geneticists go 
the whole hog 


Genome will benefit farmers and medical researchers. 


BY ALISON ABBOTT 


r | 1. J. Tabasco is something of a porcine 
goddess at the University of Illinois, 
Urbana-Champaign, where her ruddy, 

taxidermied head looks down from the office 

wall of geneticist Lawrence Schook. Now she 
has been immortalized in this week’s Nature’ — 
not by name, but by the letters of her DNA. 

Scientists are salivating. For the past couple 
of decades they have been slowly teasing infor- 
mation from the pig genome, applying it to 
breed healthier and meatier pigs, and to try to 
create more faithful models of human disease. 
This week’s draft sequence of T. J’s genome (see 
page 393), with its detailed annotation — a ‘ref- 
erence genome’ — will speed progress on both 
fronts, and perhaps even allow pigs to be engi- 
neered to provide organs for transplant into 
human patients. “Agriculture in particular will 
benefit fast.” says Alan Archibald of the Roslin 
Institute in Edinburgh, UK, one of the paper’s 
lead authors. “The pig industry has an excel- 
lent track record for rapid adoption of new 
technologies and knowledge:” 

T. J.,a domestic Duroc pig (Sus scrofa domes- 
ticus), was born in Illinois in 2001. The next 
year, Schook and his colleagues generated a 
fibroblast cell line from a small piece of skin 
from her ear and commissioned clones to be 
created from it, so that they could work on ani- 
mals all with the same genome. One set of clones 
was created at the National Swine Resource 
and Research Center (NSRRC) in Columbia, 
Missouri, along with genetically engineered pigs 
with genes added or deleted to mimic human 
diseases.“Making such pigs has got increasingly 
easier as knowledge of the genome increases,” 
says physiologist Randall Prather, a co-director 
of the NSRRC, which is funded by the National 
Institutes of Health (NIH). 

The NIH launched the NSRRC in 2003 to 
encourage research in pig disease models. Pigs 
are more expensive to keep than rodents, and 
they reproduce more slowly. But the similarities 
between pig and human anatomy and physiol- 
ogy can trump the drawbacks. For example, 
their eyes are a similar size, with photorecep- 
tors similarly distributed in the retina. So the 
pig became the first model for retinitis pig- 
mentosa, a cause of blindness. And four years 
ago, researchers created a pig model of cystic 
fibrosis’ that, unlike mouse models, developed 


T. J. Tabasco, 
star of the show. 


symptoms resembling those in humans. 

Geneticist and veterinarian Eckhard Wolf at 
the Ludwig-Maximilian University in Munich, 
Germany, has exploited the similarity between 
the human and pig gastrointestinal system 
and metabolism — like us, pigs will eat almost 
anything and then suffer for it — to develop 
models of diabetes. One pig model carries a 
mutant transgene that limits the effectiveness 
of incretin, a hormone required for normal 
insulin secretion’. Mice with the transgene 
developed unexpectedly severe diabetes, but 
the pigs have a more subtle pre-diabetic con- 
dition that better models the human disease. 
“This shows the importance of using an animal 
with a relevant physiology,” says Wolf. 

Pig models are now being developed for other 
common conditions, including Alzheimer’s 
disease, cancer and muscular dystrophy. This 
work will be enriched by the discovery, reported 
in the genome paper, of 112 gene variants that 
might be involved in human diseases. Knowl- 
edge of the genome is also allowing scientists 
to try to engineer pigs that could be the source 
of organs, including heart and liver, for human 
patients. Pig organs are roughly the right size, 
and researchers hope to create transgenic pigs 
carrying genes that deceive the immune system 
of recipients into not rejecting the transplants. 

Back on the farm, early knowledge about 
the pig genome led to the discovery in 1991 of 
a gene involved in porcine stress syndrome, in 
which the stress of overheating, being moved or 
even having sex causes the animals to die sud- 
denly*. Itthen became possible to test forthe > 
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> geneand select pig stocks free of it. 

Having the full genome should also help 
investigators to breed out susceptibility to por- 
cine reproductive and respiratory syndrome 
(PRRS), a viral disease costing the US pig 
industry US$600 million per year. The PRRS 
Host Genetics Consortium, a network of US 
research groups, has identified a region on one 
chromosome that affects levels of virus in the 
blood during infection’. Archibald, who works 
on PRRS, says that the high-quality genome 


sequence should help investigators zero in on 
the genes responsible. 

But the pig genome is not just about appli- 
cations. Lead co-author Martien Groenen, a 
genome researcher from Wageningen Univer- 
sity in the Netherlands, has resequenced the 
genomes of scores of different strains of wild 
and domestic pigs, and used the information to 
show that the pig was domesticated indepen- 
dently in Asia and Europe. He has also started 
to work out which genes were involved in the 


selection of desired traits — such as a longer 
spine to give more bacon — on different con- 
tinents. “It’s curiosity-driven research, but it 
may also help animal breeders in the future, 
he says. = 
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Investment relief 
for biotech sector 


Public markets provide cash injection for struggling field. 


BY HEIDI LEDFORD 


obert Forrester gets a little giddy when 
R: talks about the day his company 

went public. The otherwise understated 
chief operating officer of Verastem, a small 
biotechnology company developing drugs to 
target cancer stem cells, chuckles and bounces 
in his chair as he recounts key strategic deci- 
sions along the way to the company’s initial 
public offering (IPO) on 26 January, which 
raised US$55 million. 

Until recently, Verastem’s IPO would have 
stood little chance. Few biotech companies 
have braved an IPO in the years since the 
global recession hit, and those that did often 
took a beating in the public markets. Venture 
capitalists began to pull out of the sector. Col- 
leagues scoffed when Forrester told them that 
Verastem, a young company in Cambridge, 
Massachusetts, with no clinical data was going 
public. “Many people said, ‘you've got to be 
kidding” he recalls. 

But the IPO drought may be ending. This 
year has seen 12 biotech IPOs, and others are 
in the pipeline. So far, this has pumped some 
$800 million into the sector, according to 
Renaissance Capital, an [PO-research com- 
pany based in Greenwich, Connecticut. And 
biotech stocks are doing well — the NASDAQ 
Biotech Index has outperformed the NASDAQ 
Composite Index for the past 20 months (see 
‘Bullish on biotech’). “If this trend holds, it 
could be great news for the sector,’ says Josh 
Lerner, who studies venture capital at Harvard 
Business School in Boston, Massachusetts. 

Restoring access to the public markets — 
particularly for young companies that have few 
fund-raising options left — can give companies 


the capital they need to expand research pro- 
grammes, hire more researchers or even just 
survive. It can also grant them access to ‘gener- 
alist’ investors who do not specialize in health 
care. “Public investors who may have been out 
of biotechnology for the past couple of years 
have started to move back in,” says James 
Healy, a general partner at venture-capital firm 
Sofinnova Ventures in Menlo Park, California. 

Observers credit several factors for the 
rising investor confidence in biotech. Large 
pharmaceutical firms eager to restock drug 
pipelines are gobbling up smaller firms at 
high prices. Biotechnology companies have 
celebrated several high-profile successes in 
the past 18 months, with the US Food and 
Drug Administration approving groundbreak- 
ing drugs such as vemurafenib, a genetically 
tailored drug for advanced melanoma whose 
prowess in knocking out tumours is matched 
by its jaw-dropping price tag — more than 


BULLISH ON BIOTECH 


An index fund of biotech companies is 
outperforming the NASDAQ composite index. 
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$50,000 for six months of treatment. 

Investors may also be drawn to the sec- 
tor because of the poor performance of other 
industries, which are suffering more directly 
from the sluggish US economy, says Eric 
Schmidt, an analyst at investment bank Cowen 
and Company in New York. “Biotech earnings 
tend to grow independently of the economy, 
unlike electronics or consumer products,’ says 
Schmidt. “Everybody needs medicine” 

The wave of public investment could help to 
offset the dearth of venture capital. A survey 
released last year by the National Venture Cap- 
ital Association, headquartered in Arlington, 
Virginia, showed that nearly 40% of venture 
capitalists had decreased their investment in 
biotech during the previous three years, put off 
by the long timelines and high risks of drug 
development. Several prominent health-care 
funds have closed altogether. “Biotech is a 
money-eating machine,” Lerner says. “The 
need for capital is so large, and given what's 
happened to venture capital, having alterna- 
tives is important.” 

Nowhere is that need greater than in young 
companies, the riskiest of all biotech invest- 
ments, which have been among the hardest hit 
by the drop in venture funding. Verastem’s IPO 
money advanced the company’s business plan 
by two years; it should begin phase II trials of its 
leading compound by mid-2013, Forrester says. 

But IPOs are not necessarily the answer for 
all struggling biotech ventures, cautions Brian 
Atwood, a managing director of Versant Ven- 
tures, a venture-capital firm in Menlo Park. He 
notes that many of the companies that pulled 
off IPO triumphs this year are unusual in 
some respect. Kythera Biopharmaceuticals of 
Calabasas, California, for example, is particu- 
larly appealing to investors because patients 
will have to pay out of their own pockets for 
its leading product — a fat-fighting injection 
designed to shrink double chins — rather than 
relying on health insurance and its accompa- 
nying cost controls. And Verastem’s Forrester 
can barely utter a sentence without referencing 
the company’s executives and scientific advi- 
sory board: a whos who of Boston's biomedical 
glitterati. Healy agrees: “It’s a higher-quality set 
of companies that have recently gone public 
compared with those that may have gone pub- 
lic five years ago.” m 


SOURCE: NASDAQ 


IN FOCUS | NEWS 


Berlin aims to create 
research powerhouse 


German government finds a creative way to fund universities. 


BY QUIRIN SCHIERMEIER 
CC erlin is poor but sexy,’ Klaus 
Wowereit once famously said of 
Germany’s capital, where he has 
served as mayor since 2001. Now he is hoping 
that more money will buy it a little extra love, 
at least from biomedical researchers. Last 
week, he proudly announced an opulent deal 
to boost the city’s scattered health-research 
base, sweetened with a hefty chunk of federal 
funding. Those in charge of the new Berlin 
Institute of Health (BIH) that was created by 
the deal believe that it could rival research 
powerhouses in the United States and Britain. 

It also offers a model for circumventing 
the country’s long-standing restrictions on 
federal funding of universities, a rule that 
baffles many outsiders. “That's the way for- 
ward, no matter what critics might say,’ says 
Wolfgang Herrmann, president of the Tech- 
nical University of Munich, one of Germany’s 
highest-ranked research universities. 

The BIH will ally the Charité, Berlin’s larg- 
est university clinic, with the Max Delbriick 
Center for Molecular Medicine (MDC), a 
national biomedical research centre. Over 
the next five years, the institute will receive 
more than €300 million (US$380 million) in 
extra funding, 90% of which will come from 
federal budgets, with the remainder coming 
from the city and private sources. From 2018 
onwards, the federal government will perma- 
nently support the BIH with an annual sum 
of €80 million. “That’s a marvellous windfall,” 
says Walter Rosenthal, the MDC’s scientific 
director. “It could help us set up in Berlin 
something equivalent to the Howard Hughes 
Medical Institute in Maryland — it’s a once-in- 
a-lifetime opportunity.” 

The MDC researches basic molecular medi- 
cine, but has little direct access to patients, so 
teaming up with the Charité should help to 
speed its discoveries to the clinic. Under the 
auspices of the BIH, molecular biologists and 
clinical researchers will join forces to tackle a 
spectrum of illnesses, from cancer to cardio- 
vascular disorders, and neurodegenerative dis- 
eases such as Parkinson's and Alzheimer’s. The 
BIH will complement Germany’s six national 
health-research centres, which have narrower 
focuses. Work at the new institute, spread 
across Berlin’s health campuses, could start next 
spring. Research coordinators at the Charité 


and the MDC have put together a preliminary 
research plan, which an international evaluation 
team will review by March 2013. 

A significant portion of the start-up cash, 
says Rosenthal, will be used to equip BIH 
groups with state-of-the art sequencing, mass 
spectroscopy and bio-imaging technologies, 
and to expand the MDC’s bioinformatics 
capacity. The newly equipped labs and secure 
long-term funding should lure biomedical 
researchers from around the world. Over the 
next 8 years, up to 60 collaborative groups 
could be set up, Rosenthal says. 

“Recruitment is the bottleneck when it 
comes to turning Berlin into a health-science 
hub of truly international rank,” says Claus 
Scheidereit, an oncologist who coordinates 
cancer research at the MDC. “If we can get over 
that, we can start to think really big.” 

The BIH is the German government’ latest 
attempt to inject national funds into universi- 
ties, circumventing a highly federalized system 
in which state governments jealously guard 
their responsibility for universities. The gov- 

ernment’s acclaimed 


“It could help us ‘excellence initiative, 
set up in Berlin for example, invited 
something German universi- 
equivalent to ties to compete for 
the Howard federal top-up grants 
Hughes Medical and has so far gen- 


erated thousands of 
new science jobs. 
Collaborations that resemble the BIH, linking 
national research centres and universities, have 
also been forged in Karlsruhe, and between 
Julich and Aachen. 

A proposed amendment to Germany's con- 
stitutional law would allow the federal govern- 
ment to co-finance universities permanently, 
but it is unlikely to win the required two-thirds 
majority in parliament. Its opponents fear that 
the bill would allow Berlin to dictate national 
policy in areas, particularly secondary educa- 
tion, in which the states enjoy near-total control. 

Scientists maintain that Germany’s federal- 
ized university system is hampering the crea- 
tion of national research hubs such as those in 
neighbouring Switzerland. “Why shouldn't the 
Technical University of Munich become Ger- 
many’s federal institute of technology?” says 
Herrmann. “Because the rules dort allow it? 
If it helps our country, we should change them 
sooner rather than later.” m 
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Jonathan Tilly defied decades of dogma 
by suggesting that women can make new 
eggs throughout their lives. Now some of 
his critics are taking a second look. 


onathan Tilly likes to gauge the signifi- 

cance of his work by the hair on the backs 

of his arms. “Look at it standing up,” he 

says, thrusting out his forearm on a 

mid-August afternoon. A reproductive 
biologist at Massachusetts General Hospital 
in Boston, Tilly was explaining a procedure to 
retrieve stem cells from the ovaries of a sterile 
woman. This experiment, he hopes, will help to 
quell criticism of his most controversial claim: 
that ovaries have the potential to make eggs 
indefinitely. This defies the long-held dogma 
that female mammals are born with all the 
oocytes (precursors to eggs) they will ever pro- 
duce, a population that dwindles with age and 
is exhausted at menopause. 

Tilly first challenged that doctrine in 2004, 
in a paper' suggesting that the oocytes in 
mouse ovaries are being replenished by stem 
cells. If properly understood, such cells could 
be harnessed to generate fresh eggs for women 
with fertility problems, or even achieve a goal 
Tilly has been pursuing for 25 years: delaying 
or halting menopause. “The hairs are still up,” 
Tilly says. It “happens every time I think about 
that experiment”. 

He has since published a parade of headline- 
grabbing papers, culminating this year in a 
report’ that he had isolated the elusive stem 
cells from human ovaries and coaxed them to 
develop into bona fide oocytes. But his work 
has been dogged by doubt. Some researchers 
question his methods and reasoning. Others 
have tried, and failed, to repeat his experi- 
ments. Tilly “always makes what I call ‘big sat- 
ellites, something tremendous in the sky,’ says 
molecular biologist Kui Liu at the University 
of Gothenburg in Sweden. “He exaggerates,” 
Liu says, and produces a “big press release”. “A 
few years later, people realize, ‘Oh, not right.” 

Tilly says he has weathered a lot of attacks. 
“When I made the decision to pursue this, 
it was out of pure excitement that we found 
something that could revolutionize the field. 
It never even crossed my mind that it would be 

so negative and so nasty. 


> NATURE.COM And it really is negative 
For a podcast and nasty,” 

about this story, But now the stand-off 
visit: of mistrust, and some- 
go.nature.com/Grjhst times open contempt, 


has taken a strange twist. Two of Tilly’s most 
vociferous critics have become his collabora- 
tors: one serving on the board of advisers at his 
start-up company, OvaScience in Cambridge, 
Massachusetts; the other working directly with 
the stem cells that Tilly had isolated. “These cells 
are doing things in vitro that can really start to 
address scientific problems,” says Evelyn Tel- 
fer, a reproductive biologist at the University 
of Edinburgh, UK, who was doubtful of Tilly’s 
work in the past. “If we are really interested in 
the science ... then this is a great tool” 


A COUNTING PROBLEM 

The ‘no new eggs’ doctrine has a long history. 
In 1951, the influential anatomist Solly Zuck- 
erman, at the University of Birmingham, UK, 
performed an in-depth analysis of evidence 
available at the time. He concluded that none 
of it effectively countered a proposal from 
the 1870s stating that female mammals stop 
producing oocytes after birth’. 

For the first 15 years of his career, Tilly 
focused mainly on programmed cell death, or 
apoptosis, and he was struck by the fact that 
no one had ever quantified the loss of eggs due 
to ovulation and natural oocyte death over 
time. So beginning around 1999, Tilly com- 
mandeered a microscope and mouse ovarian 
tissue in order to count the follicles, the cel- 
lular compartments in which oocytes develop, 
in mice at different ages. He found a mathe- 
matical imbalance: the number of degenerated 
follicles was three times higher than expected 
on the basis of the starting pool. If the mice 
were losing oocytes at this rate, their eggs 
should be depleted far sooner than they actu- 
ally were. Something had to be replacing them, 
he concluded: stem cells were the likely culprit. 

Few were willing to accept the idea. It took 
Tilly two years — and numerous rejections and 
revisions — to get the data published in Nature, 
in 2004. Controversy ensued over his methods 
as well as his conclusions. One critique said, for 
example, that it was “alarming” that Tilly used 
the rate of follicle disappearance in one mouse 
strain to calculate loss for another’. 

Tilly dropped most of his apoptosis work 
and steered his entire lab towards proving 
the existence and functionality of these stem 
cells. “You are sort of standing on the precipice 
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wondering whether or not you should make 
the jump,’ he says. “Getting the 2004 paper 
published was for me the jump, because there 
was no turning back at that point” 

A year later, Tilly reported that he had iden- 
tified the source of these putative cells: bone 
marrow’. When he transplanted either mar- 
row or blood from healthy mouse donors into 
sterile mice, the animals could produce cells 
that looked like oocytes. But he could not yet 
fertilize the resulting eggs and create 
embryos — the true test of an egg stem cell. 

At least six groups challenged the bone- 
marrow finding. In one critique’, a group led 
by Telfer wrote that none of Tilly’s experiments 
had successfully been replicated, and that the 
results could be interpreted in other ways. Crit- 
ics also asserted that Tilly was overreaching, 
particularly in media interviews. 

In The Boston Globe in 2005, for example, 
Tilly is quoted as saying: “They're your own 
cells; you don't need anybody’s approval. They 
go right into your blood supply and go right 
to your ovaries, where they mature into eggs.” 
David Albertini, a reproductive biologist at the 
University of Kansas Medical Center in Kan- 
sas City, calls such claims outrageous: “A lot 
of us reproductive biologists feel that this is a 
frank travesty that has falsely raised the hopes 
of many women.” 

Tilly defended his comments and challenged 
his peers to go back to their labs and reproduce 
his experiments. Several did. In 2006, stem-cell 
biologist Amy Wagers at Harvard University in 
Cambridge, Massachusetts, and her collabora- 
tors stitched together the circulatory systems 
of two mice’. One, the donor, expressed green 
fluorescent protein (GFP) in its cells. The other 
did not. The scientists found that although 
green, glowing, blood-borne cells could infil- 
trate the ovaries of the recipient mice, these 
cells acted like blood cells, not oocytes. 

Tilly, in response, performed a similar 
experiment, showing that mice sterilized by 
chemotherapy could give birth after a bone- 
marrow transplant’. But the babies did not 
express GFP, indicating that the eggs from 
which they were derived came from the recipi- 
ent, not the donor. Tilly argued that the bone 
marrow either protected existing oocytes or 
revived oocyte formation, but critics argued 
that the chemotherapy probably didn't kill off 
all the recipient’s oocytes in the first place. 


SHANGHAI SURPRISE 

With little independent replication of his 
work, Tilly was standing alone through much 
of the fray. Then, in 2009, Ji Wu at Shanghai 
Jiao Tong University in China and her col- 
leagues reported that they had isolated from 
mice what she called “female germline stem 
cells” — not from bone marrow, but from ovar- 
ian tissue’. When her team transplanted the 
cells into chemotherapy-treated female mice, 
they developed into mature oocytes, then fer- 
tilizable eggs and, the clincher, healthy pups. 
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Although previously sceptical of the pros- 
pect, Telfer says that when she read Wu's paper 
in 2009 she paused, thinking “there must be 
something in this”. She had met Tilly at the bar 
during a scientific meeting the year before and 
they talked about their differences. After Wu's 
paper, the two co-authored a commentary that 
articulated something like a truce’®. “Although 
these findings do not establish that oogen- 
esis occurs in adult females under physiologi- 
cal conditions,’ Tilly and Telfer wrote, “they 
strongly support the existence of [germline stem 


adds that Liu’s group “didn’t use our protocol 
of isolating the cells. So how to compare?” 

In fact, Tilly says that his lab had trouble 
repeating Wu's protocol, too. Eventually, his 
team retrieved cells but “found consistent 
oocyte contaminatiom” He had to modify the 
protocol to retrieve the mouse and human 
oogonial stem cells, and they differed in size 
from those Wu had isolated. Wu says that her 
cells and Tilly’s are probably “subtypes” of each 
other and that there is still “a lot of work to do” 
to figure out exactly how they are related. 


“There are a lot of people struggling to 
understand how this can possibly work.” 


cells] in adult mouse ovaries. If equivalent cells 
can be found in human ovaries, stem-cell-based 
rejuvenation of the oocyte reserve in ovaries on 
the verge of failure may one day be realized” 

Many set out to replicate Wu’s results, 
including Tilly. And in February, he reported 
the isolation of what he called “oogonial stem 
cells” from human ovaries’. By injecting the 
cells into human ovarian tissue transplanted 
into mice, he was able to generate both folli- 
cles and what seemed to be mature oocytes (see 
Nature 483, 16; 2012). 

“So now we have two different labs, using 
conceptually a similar protocol, and both 
groups got confirmatory data,” he says. “We felt 
at that point there should be no more debate,” 

But there was. Critics soon began point- 
ing out a problem shared by both the teams’ 
approaches. Each identify their respective 
stem cells using antibodies meant to bind a 
cell-surface protein, a common technique in 
cell biology. But the protein they target, called 
vasa, normally sits inside the cell, not on its 
surface. “There are a lot of people in the field 
struggling to understand how this can possibly 
work,’ says Patricia Hunt, a reproductive biolo- 
gist at Washington State University in Pullman. 

Tilly says that although mature oocytes do 
not express vasa on their surface, his cells — 
which are a cross between embryonic precur- 
sors and full-blown oocytes — do. Vasa, he 
says, becomes non-detectable on the cell sur- 
face as the cells mature into eggs. But, he adds, 
“We don't have any proof of that yet.” 

Liu in Sweden says that he initially believed 
Wu's paper when it came out. But his group 
could not repeat the technique. To bypass the 
cell-surface problem with vasa, Liu used an 
approach that tracks the protein inside the 
cells''. He was able to extract ovarian, vasa- 
expressing cells, but none of them underwent 
division — a major criterion for stem cells. 

Wu contends that her cell-isolation tech- 
nique is not easy to perform and invites 
scientists to come to her lab to learn it. She 


Telfer, meanwhile, has begun to collabo- 
rate with Tilly. After going to Boston in 2011 
to observe his human stem cells, she was 
impressed, and tooka sample back to Scotland. 
Her team had worked outa culture system using 
mouse and cow tissues to grow egg precursor 
cells into fertilizable eggs entirely outside the 
body. With Tilly’s cells she needed to adapt the 
technique for use in humans. “The first experi- 
ments blew me away, just blew me away,’ she 
says. Tilly’s cells, she found, grew rapidly into 
oocyte-like structures. “T spent the whole night 
trying to find another explanation other than 
new follicles had formed} she says. “And I could 
not come up with one” 

Telfer has applied for permission from the 
UK Human Fertilisation and Embryology 
Authority to attempt to fertilize the cells; such 
experiments are forbidden using US federal 
funding. If successful, the technique to make 
fertilizable human eggs outside the body could 
eventually be disseminated to fertility clinics 
throughout the world. 


BOUNDARIES AND BACKLASH 
The cells are also being used at OvaScience, 
which was founded in April 2011 and has 
secured US$48 million in venture capital. The 
company is exploiting Tilly’s cells in several 
ways. One aim is to rejuvenate egg cells from 
older women by adding fresh cytoplasm and 
mitochondria. The research builds on a con- 
troversial experimental fertility technique in 
which egg cells are injected with cytoplasm 
from another woman's eggs. The OvaScience 
approach would use mitochondria extracted 
from the mother’s own oogonial stem cells, 
which Tilly says would be healthier than those 
from an ageing mother’s eggs, and should skirt 
some of the ethical and safety questions raised 
by using donor mitochondria. OvaScience 
plans to begin clinical trials this year in collab- 
oration with two Boston-based fertility clinics. 
Tilly's oogonial stem cells will also serve as a 
screening tool for new drugs that might block 
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or boost egg production. Such drugs might 
help reverse infertility or even help delay or halt 
menopause. Albertini still worries that such 
claims inflate hopes but, like Telfer, he is try- 
ing to keep an open mind. The prospect of new 
models for screening fertility drugs convinced 
him to join OvaScience’s scientific advisory 
board. “There are a lot of things that I know 
and I do that could be helpful to them,” he says. 

The company is swiftly moving forward, 
steered by the team that guided Sirtris, based 
in Cambridge, Massachusetts, a biotech firm 
focused on anti-ageing therapies. In fact, it 
was a collaboration between Tilly and Sirtris’s 
founder, David Sinclair, at Harvard Medical 
School in Boston, that sparked the launch of 
OvaScience. 

He and Tilly are “mutual admirers’, Sinclair 
says, explaining that they joined forces in 2009 
to explore the idea that egg quality declines 
with age because older eggs lack enough energy 
to support fertilization. Sinclair offers his anti- 
ageing expertise and his experience of contro- 
versy; some of the initial results on which Sirtris 
was founded could not be replicated and have 
been a source of contention in the field (see 
Nature 464, 480-481; 2010). 

“It is an interesting team that Jon and I 
make,” Sinclair says, “because the two of us 
push the boundaries of science. And both of 
us have encountered backlashes in doing so.” 

Still, despite his characteristic gumption 
and ebullience, Tilly seems to be burdened by 
the continual sparring. Although he’s shifted 
much of his time to studying oogonial stem 
cells in the ovary, he still maintains that bone- 
marrow stem cells might also create new eggs. 
His critics disagree, and even if they accept the 
existence of oogonial stem cells, they still ques- 
tion whether such cells normally function to 
produce new eggs. 

“The data provided so far don't support this 
concept,” says Albertini. Tilly maintains that 
these stem cells must be doing something in 
the body. But in exasperation, he is willing to 
concede that it may not matter in the clinic. 
“If you could take these cells outside the body, 
and get them to make a functional egg that can 
make a normal healthy baby, what do you care 
about the physiology?” = 


Trisha Gura is a freelance writer in Boston, 
Massachusetts. 
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UANTUM LEAPS 


Fully fledged quantum computers are still a long way off. But devices 
that can simulate quantum systems are proving uniquely useful. 


BY GEOFF BRUMFIEL 


hen high-energy physicists announced in July that they 
had found the long-sought Higgs boson — their biggest 
find in decades — the thousands of individuals involved 
rightly held their heads high. But in some sense, they had 
already been beaten to the prize. 

Months earlier, a team of nine physicists had taken a rarefied 
vapour of rubidium-87 atoms, cooled it down to very near absolute 
zero and used lasers to arrange the atoms into a tiny grid. The physi- 
cists then tweaked the temperature until the atoms neared a critical 
‘phase transition — a point between two different behaviours, such 
as liquid water and solid ice. Monitoring their grid in this in-between 
region, the researchers saw an unusual wave of energy that appeared 
momentarily and then died away’. Mathematically speaking, this 
behaviour was the same as the appearance and decay of a Higgs 
particle inside a particle collider. 

“Obviously, it’s not at all the Higgs particle,” says Immanuel Bloch, the 
researcher who led the study at the Max Planck Institute for Quantum 
Optics in Garching, Germany. If nothing else, this particle moved 
in only two dimensions, whereas the Higgs moves in three. But the 
experiment is still helpful for particle physicists, says Bloch, because 
it gives them a new way to explore and test the complex quantum field 
theories that underlie the Higgs. 

This experiment also put Bloch and his team at the vanguard of 
the rapidly growing field known as quantum simulation. The idea, 
broadly speaking, is to use orderly systems such as a grid of atoms to 
model much more complicated things — new particles, for example, 
or high-temperature superconductors. The behaviour of such systems 
cannot be derived by hand, and even the world’s fastest super- 
computers can’t model them. 

Quantum simulators are the lesser sibling of an idea in physics 
known as quantum computers, which have been touted for more than 
three decades as a way to do everything from complex modelling to 
code-breaking. What the simulators and computers share is an ability 
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to operate by the rules of quantum mechanics. Where they differ is 
in computational power: quantum computers are general-purpose 
machines able to carry out any possible algorithm, whereas quantum 
simulators have to be tailored specifically for the problem at hand. 
Current-generation simulators are also tough to control, and they 
may not be able to tackle every problem. Nevertheless, the simulators 
are much easier to build than quantum computers. And researchers 
say that the devices will soon be able to solve at least some quantum 
problems that can't be tackled in any other way. 


NUTS AND BOLTS 

The world of quantum physics is full of theorems, but one goes 
unwritten: if you want to get noticed, show that your idea came from 
Richard Feynman. 

Feynman, the mid-twentieth-century’s greatest theoretical 
physicist, came up with the idea of quantum simulation in 1981 when 
he was asked to deliver a keynote speech at the Massachusetts Insti- 
tute of Technology (MIT) in Cambridge’. He decided to talk about 
how physics might be simulated with computers and got straight to 
the core of the problem: computers run on certainty, but at a fun- 
damental level, nature deals in probability. According to the laws of 
quantum mechanics, he knew, particles very rarely exist in one state 
or another, but instead live in a ‘superpositior of two states at once. 
When observed, the paradox resolves itself according to the laws of 
statistics. For example, an electron’s ‘spin’ may orient itself in one 
direction half the time, and in the other direction for the other half. 

It is not hard to program a normal computer to model the 
probabilistic behaviour of that one electron, said Feynman. But 
particles do not live in isolation, and in quantum systems their prob- 
abilities are linked, or ‘correlated’ These correlations mean that every 
combination of particle states must be computed separately, and this 
creates an exponential rise in complexity. A system with three elec- 
trons has eight possible configurations, with eight probabilities to 
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QUANTUM BOARD GAMES 


The set-ups of quantum simulators are different, but the concept is the same: first take atoms, ions or electrons, cool them to cryogenic 
temperatures and arrange them in an orderly grid. Then tune the interactions on the grid to mimic a more complex material. 


COLD ATOMS 


Rubidium atoms are held in place by criss-crossed 
laser beams, which can also be used to tweak 
individual particles. A single pair of lasers holds the 
atoms in a one-dimensional column (top), whereas two 
pairs hold them in a grid (bottom). Some excitations in 
the grid system behave like the Higgs particle. 


TRAPPED IONS 


A combination of electric and magnetic fields 
trap charged, ionized atoms in an orderly grid. 
The ions wiggle and rotate in a way that 
mimics the interactions of quantum 
magnetism — a phenomenon that can’t be 
simulated in classical systems. 
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SUPERCONDUCTING LOOPS 


A quantized loop of current can flow clockwise, 
anticlockwise or in a superposition of both in a 
superconducting circuit (top). An array of such loops 
(bottom) can be manipulated to simulate various 
quantum systems — and perhaps even biological 
processes such as photosynthesis. 


Cold atomic gas 


ui 
iit 


lons 


» 


Optical 
lattice 


Rubidium 


compute; 300 electrons create as many configurations as there are 
atoms in the known Universe. 

Feynman spent most of his lecture trying to find a way out of this 
conundrum. It is not easy using ordinary computers, he concluded, 
but there is another possibility: build a computer that thinks in terms 
of probabilities. This quantum imitator, as he called it, would looka 
lot like whatever system you were trying to model. It wouldn't need to 
crunch every outcome, but instead would simply recreate the range of 
probabilities. Rather than delivering one solution, the imitator would 
deliver many, and the likelihood of each answer would create a proba- 
bilistic picture ofhow the complex system behaves. Feynman didn't do 
the maths, but he did conclude that 
almost any quantum system “can be 
simulated in every way, apparently, 
with little latticeworks of spins and 
other things”. 

At the time of Feynman’s talk, 
the little lattices of which he spoke 
didn't exist. Quantum systems are 
extremely fragile, in the sense that 
almost any interaction with the out- 
side world will destroy the delicate 
correlations. It has taken 30 years to develop the technology required to 
keep the particles isolated enough to finish the simulation unimpeded, 
yet interactive enough to let physicists extract the answer. But there 
are now several options. Bloch’s group uses neutral atoms, other teams 
are combining electric and magnetic fields with lasers to trap ions of 
lighter atoms, such as beryllium. A third technique involves controlling 
eddies of current inside superconducting microcircuits, and a fourth 
uses quantum particles of light — photons — moving through micro- 
scopic waveguides (see ‘Quantum board games’). 

All these techniques are rapidly increasing in their capabilities. 
In April, a group led by John Bollinger at the National Institute 
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of Standards and Technology in Boulder, Colorado, unveiled a 
two-dimensional system of hundreds of trapped ions that could 
simulate a form of quantum magnetism’. The simulator seems to 
work well for weak fields of the sort that can already be modelled on 
classical computers, says Bollinger. Now, with some modifications, 
he hopes to simulate strong magnetic fields, which are beyond the 
reach of even the most powerful supercomputers. 

Bloch, meanwhile, is considering applications beyond the Higgs 
for a neutral-atom simulator. For example, the rubidium atoms in 
his lattice might be used to model a complex class of materials called 
high-temperature superconductors. These ‘high-T,’ materials can 
conduct electrons with no resist- 
ance at temperatures much higher 
than conventional superconductors 
can — but for decades nobody has 
been able to understand why. Theo- 
rists have developed a number of 
competing models to explain the 
behaviour, but haven't been able 
to test them: the electrons in the 
superconductors are just too dif- 
ficult to isolate and study. So Bloch 
wants to use atoms as surrogates. By changing the intensity of the 
criss-crossing laser beams, atoms can be made to tunnel from one 
point in the lattice to another in a way that mimics the motion of 
electrons through the atomic lattice of a high-T, material. At least 
some theories of high-T. superconductivity should be checkable with 
Bloch’s set-up. 

Quantum simulators might even be able to model non-quantum 
problems, such as protein folding, that still require huge amounts of 
computing power to decipher. A group at the Canadian company 
D-Wave Systems in Burnaby and at Harvard University in Cambridge, 
Massachusetts, recently did just that by mathematically mapping 
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the folding problem onto a quantum system of 128 loops of current 
spinning ona superconducting chip’. Each loop could spin clockwise, 
anticlockwise or in a superposition of both directions simultaneously. 
The performance of the system wasn't great; in one of its protein- 
folding problems, it found the correct, experimentally determined, 
answer just 13 out of 10,000 times. Still, says Alan Aspuru-Guzik, a 
theoretical chemist from Harvard and co-author of the paper, “it’s 
remarkable to me that it was possible to do it” at all. 


GOAL CHANGE 

Despite all the technical progress, however, the existing simulators are 
at best a limited approximation of Feynman's original vision — a fully 
fledged quantum computer that is ‘universal; or able to execute any 
quantum algorithm and simulate any conceivable quantum system. 
Researchers have been exploring the potential applications of such a 
device ever since Feynman described it. Arguably the most important 
one came in 1994, when mathema- 
tician Peter Shor, now at MIT, laid 
out an algorithm that would allowa 
quantum computer to function as a 
powerful code-breaking machine’. 
Other quantum algorithms have 
followed, drawing many scientists 
(and several intelligence services) 
into the quest for quantum com- 
puting and sparking widespread 
efforts to create such a machine. 

Yet building a powerful, 
universal quantum computer has 
proven to bea tough task. A true Feynman computer would be able 
to control thousands or millions of atoms at once, but most of the 
current systems face a trade-off between size and control. Bloch, for 
example, can hold as many as hundreds of thousands of atoms in his 
laser lattice, but he can’t then set their quantum states individually. 
Other researchers have more control over individual atoms, but their 
systems, which use trapped ions of beryllium, can manage only a 
handful of atoms with exquisite precision. On top of this comes the 
omnipresent problem of disruptions from the outside world, which 
ruin delicate quantum states: even the tiniest bump will create a 
computational error. 

With current systems so far from the ideal, quantum simulators 
have come to be seen as less of a stepping stone, and more ofa goal in 
their own right. Simulators do not need to be as large as computers, 
and, crucially, because the answer is encoded as an average across all 
their atoms, they are believed to be tolerant of the outside disruptions. 
“In a quantum computer you have to make sure that no particle makes 
a mistake,” says Ignacio Cirac, a theorist at the Max Planck Institute 
for Quantum Optics. “In a quantum simulation, if you have 100 par- 
ticles and one of them is wrong, then 99 are still right.” 

Some see parallels to the middle of the last century, when scientists 
such as Vannevar Bush were experimenting with ‘analog’ comput- 
ers made from resistors and capacitors. The machines were tailored 
to specific problems or to a class of problems, and could perform a 
simple set of operations on an input signal. Some of the devices could 
even perform mathematical calculations. In retrospect, they seem 
puny compared with digital computers, which use programmable 
combinations of transistors to perform practically any program. But 
they were fast, robust and valuable for applications that matched their 
architecture, says Seth Lloyd, a theoretical physicist and engineer at 
MIT. They were particularly good at controlling machinery, for exam- 
ple. “All the control circuits in the Saturn moon rocket were analog,” 
Lloyd says. 

Like analog computers, quantum simulators are closely tied to 
their constituent parts, and are less flexible than a true quantum com- 
puter. But Lloyd thinks that they might yet find their ‘Moon shot’ in 
problems of quantum complexity. For example, as microprocessors 
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shrink and new materials are engineered at a molecular level, 
quantum effects become more and more important. That, in turn, 
will lead to a dramatically growing need for quantum modelling that 
allows designers to understand and predict the materials’ behaviour. 
At least some of those needs are going to be met by quantum simula- 
tors, Lloyd predicts. “What seems to be happening is that quantum 
simulators work on a variety of special cases,” he says, “and the num- 
ber of cases seems to be growing rather rapidly.” 

Aspuru-Guzik has one such process in mind: photosynthesis. 
When light strikes a leaf, it creates a pair of negative and positive 
charges that travel long distances to reaction centres, where they are 
used to make energy for the plant. The charge pairs may travel accord- 
ing to the rules of quantum mechanics: some researchers think that 
the collective wavefunction of the pairs spreads out across the light- 
absorbing chromophore molecules inside the leaf, allowing the pairs 
to move more efficiently than they would classically (see Nature 474, 
272-274; 2011). 

Aspuru-Guzik and others think 
that a simulator could help them to 
pin down exactly how this happens. 
Photosynthesis is what Aspuru- 
Guzik calls a “dirty quantum sys- 

em’ — that is, it contains both 
quantum and classical elements. 
A little matrix of superconducting 
current loops might be perfect for 
modelling it, he argues, because the 
loops, too, are subject to noise from 
the outside world. It still wouldn't 
be easy, however: Aspuru-Guzik estimates that something such as 
photosynthesis would require hundreds of quantum bits to simulate, 
and those systems, he predicts, are at least a decade away. 

The ambitions of the scientists developing quantum simulators are 
considerably more modest. Most are starting their systems out on 
models that can be calculated with conventional supercomputers to 
prove that their simulators produce reliable results. Gradually, they 
plan to push their atoms, current loops or other little units to the point 
at which the supercomputers can no longer cope. 

At that point, “the model that we're able to implement might not 
even correspond to a real material, but in a sense, who cares?”, says 
Chris Monroe, a physicist at the University of Maryland in College 
Park. Even if they don't behave like a superconductor or a Higgs par- 
ticle, the new systems may still be able to tell researchers a thing or 
two that their older machines can't. Eventually, Monroe and others 
believe that simulators will be tailored to model different things. Cold 
atoms, for example, might work best on superconductors, whereas 
ions could handle magnetism. Of course, there will still be quantum 
systems that are too tough for any set-up to tackle. 

It may be a vision considerably less flashy than Feynman's universal 
quantum machine, yet within the physics community, quantum simu- 
lators are getting more attention than ever before. “Many physicists 
who sort of pooh-poohed the idea of quantum computing, especially 
ten years ago or so, they're now sort of embracing this,” says Monroe. 
The systems may be less ambitious, but that may make them more 
achievable. 

Lloyd puts it another way. “If life doles you quantum lemons, let’s 
make quantum lemonade,’ he says. Simulators may not be as sweet 
as quantum computers, but “as long as the lemonade is tasty and 
refreshing, I think that’s fine”. = 


Geoff Brumfiel is a senior reporter at Nature. 


1. Endres, M. et al. Nature 487, 454-458 (2012). 

2. Feynman, R. P. Int. J. Theor. Phys. 21, 467-488 (1982). 

3. Britton, J. W. et al. Nature 484, 489-492 (2012). 

4. Perdomo-Ortiz, A., Dickson, N., Drew-Brook, M., Rose, R. & Aspuru-Guzik, A. 
Sci. Rep. 2,571 (2012). 

5. Shor, P. W. SIAM J. Comput. 26, 1484-1509 (1997). 


© 2012 Macmillan Publishers Limited. All rights reserved 


COM 


Acall for more Conservation in Italy Extrasolar Edward Donnall 
rigorous research into health 40 years on from UNESCO planets, sci-fiand Kim Thomas, bone-marrow 
impact of GM foods p.327 heritage list p.328 Stanley Robinson p.330 pioneer, remembered p.334 


ILLUSTRATION BY ANDREW RAE 


ca M/, Md) 


Secure the Internet 


Software engineers must close the loophole used to intercept 
online communications, say Ben Laurie and Cory Doctorow. 


discovered on the Internet. To any user it 
looked authentic. The software's crypto- 
graphic certificates, which securely verify 
the authenticity and integrity of Internet 
connections, bore an authorized signature. 
Internet users who thought they were apply- 
ing a legitimate patch unwittingly turned 
their computers into spies. An unknown 
master had access to all of their data. 
The keys used to sign the certificates had 
been stolen from a ‘certificate authority’ 
(CA), a trusted body (in this case, the 


I: 2011, a fake Adobe Flash updater was 


Malaysian Agricultural Research and 
Development Institute) whose encrypted 
signature on a website or piece of software 
tells a browser program that the destination 
is bona fide. Until the breach was found and 
the certificate revoked, the keys could be 
used to impersonate virtually any site on the 
Internet. 

Fake certificates are used by hackers and 
governments to harvest online commun- 
ications. In 2011, for example, a hacker 
based in Iran stole the signing keys from 
DigiNotar, a Dutch CA that declared 


bankruptcy soon afterwards. The keys were 
used to impersonate sites such as Facebook 
and Gmail in Iranian dissidents’ browsers, 
allowing all of their messages to be read. 
Certificates allow the web to work. They 
secure transactions and allow users to enter 
credit-card numbers, share data across 
networks or chat in private forums. Without 
certificates, hackers could easily stop, 
corrupt or eavesdrop on these exchanges. 
But certificates are in trouble. As more 
authorizing bodies are added to browsers’ 
lists of trusted CAs, and as governments, 
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> hackers and unscrupulous insiders 
weaken the Internet’s security system, it 
is becoming virtually impossible to know 
whether a connection is legitimate. 

Many Internet technicians, ourselves 
included, agree that it is time to fix the 
problem. What remains is the substantial 
hurdle of reaching consensus about how. 

The international Internet Engineering 
Task Force is adding registration informa- 
tion to the (already overburdened) domain 
name system (DNS). But websites can still be 
taken over. The Electronic Frontier Founda- 
tion (where C.D. is a fellow), based in San 
Francisco, California, has proposed a cryp- 
tographic protocol called Sovereign Keys 
(SK) that would 


makeitimpossible = « Itis becoming 
for a third party to virtually 
oe als any impossible to 
website. 
A third effort is know whether 
aconnectionis 


under way, led by 
a team (including 
B.L.) at Google, 
based in Mountain View, California. 
This protocol — called Certificate Trans- 
parency (CT) — is similar to SK, but it 
includes an independent cross-checking 
system. Release dates have not been set for 
either protocol, but CT has the potential 
to be rolled out sooner, through regular 
software updates for Google’s web browser, 
Chrome. We see it as a stepping stone to a 
more ambitious system, such as SK. 

We call on browser vendors to support 
a shift to a more secure system. There are 
economic barriers — no one is likely to make 
money from shoring up the Internet. But the 
risks of ignoring this security loophole are 
too great. 


legitimate.” 


CROSS-TALK 

Before your browser connects to a website, 
it asks your local network’s DNS server for 
the numeric address corresponding to the 
website’s domain name. (For example, one 
of the addresses for www.facebook.com 
is 66.220.149.11.) But DNS is not secure 
— its communications with browsers are 
unscrambled, and they are easy to intercept. 
Anyone sharing your network can steal 
your credit-card information or passwords. 

To scramble messages and keep them 
private, you need encryption. If a browser 
has received a cryptographic certificate 
signed by a CA, its address bar shows a 
key or padlock icon. All browsers have 
pre-installed lists of trusted CAs against 
which to check certificates. 

CAs do a lot of due diligence before 
issuing a certificate, but they are fallible. 
Cryptographic methods can be used to 
spot forgery and tampering, but not to 
distinguish real certificates issued by diligent 
CAs from those issued by mistake or by a 


CA that has been conned or taken over. 
The proliferation of CAs is putting the 
entire Internet at risk. Governments are not 
responding to the problem — indeed, some 
policy-makers have shown a remarkable 
willingness to undermine online security 
for law-enforcement reasons. India’s gov- 
ernment, for example, is seeking weaker 
security for Skype and BlackBerry mobile 
devices’. In 2011, US lawmakers proposed 
the Stop Online Piracy Act, which would 
require DNS providers to return false results 
when users try to connect to sites accused of 
facilitating copyright infringement’. 
Luckily, software engineers are in a 
position to fix the certificate loophole. 


CERTIFICATE TRANSPARENCY 

CT and SK rely on a type of record that 
uses cryptographic methods to prove that 
none of its past entries has been erased 
— an ‘untrusted, verifiable, append-only 
log’. This log is based on a mathematical 
principle called a Merkle tree — a hierarchy 
of linked items, or ‘leaves’ 

In CT, each leaf is a certificate. For a par- 
ticular node in the tree, a value can be gener- 
ated in a few steps on the basis of the values 
of other nodes. For a tree with, say, a million 
leaves, a verifier would have to track only 20 
nodes to confirm any particular leaf. Even 
a tiny alteration throws off the calculations 
entirely. With Merkle trees it is possible to 
prove efficiently that a particular leaf is in 
the tree without revealing the contents of 
the other leaves. It is also impossible to fake 
such a proof for a leaf that is not in the tree. 

Merkle-tree logs are stored on a small 
number of computers, or log servers. Every 
time a CA generates a new certificate, it 
sends a copy to all the log servers, which 
then return a cryptographically signed 
proof that the certificate has been added 
to the log. Browsers could be pre-config- 
ured with a list of verified log servers (in 
addition to the list of CAs now installed). 

Periodically — perhaps hourly — a 
number of ‘monitor’ servers contact the 
log servers and ask for a list of all the new 
certificates for which they have issued 
proofs. These monitors — operated by 
companies, banks, individuals and service 
providers — would discover any unauthor- 
ized certificates, just as credit reports alert 
people to cards or loans issued falsely in 
their names. 

This process works only if the log servers 
are honest; here, auditor servers come in. 
Every so often, a browser sends all the proofs 
it has received to a number of auditors — 
anyone may act as one, because the logs 
are public. If a proof has been signed by 
a log server but does not appear in its 
log, the auditor knows that something 
is wrong. Within an hour of committing 
their first transgressions, rogue CAs and 
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log servers could be detected and removed 
from browsers lists. 

Even with modest uptake, CT will begin 
cleaning up the Internet immediately on 
roll-out. Blocking will improve as more 
organizations, browser vendors and users 
participate. Initially, browsers that adopt 
CT will not be able to block connections 
for which no proofs are offered. After a 
year, Chrome will be updated to warn users 
before establishing a secure connection 
without a proof. Later, it will not connect 
to any site without one. We hope that other 
browsers will follow a similar path. 

Also based on Merkle-tree logs, SK is 
a more theoretical approach. Instead of 
requiring CAs and domain registrars, SK 
issues a private key for each website to only 
one holder. No one else may use that key, 
so misuse by a third party is not a prob- 
lem. Anything unverified is blocked, so SK 
would foil attempts by governments to use 
domain seizures to censor content that they 
find to be objectionable. 


LONG-TERM GAIN 

Through systems such as CT and SK, soft- 
ware engineers can and should solve the 
certificate problem. After all, the Internet 
is international and independent; govern- 
ments cannot mandate a solution. 

There are some economic barriers to 
improving security, but it isa worthwhile 
investment. CAs have no short-term incen- 
tive to support these measures, but the CA 
network offers the best avenue for rolling 
out steps such as CT. Similarly, someone 
must run the log servers, even though doing 
so will not lead to direct economic gain. But 
the long-term stability and security of the 
Internet is good for business. In line with 
this view, Google will run some log servers 
— but others should as well, so that we can 
avoid putting all of our eggs in one basket. 
Browser vendors should also commit to 
supporting CT. 

History tells us that those who seek new 
avenues of attack will eventually find them. 
But this troubling breach must be closed 
down now. = 
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Bring more rigour to GM research 


The latest furore over GM food highlights the need for good-quality research 
on highly sensitive topics, says Francois Houllier. 


his autumn, a team of French 

researchers published results showing 

that rats fed with genetically modified 
(GM) maize (corn) died younger and showed 
more organ damage and tumours than usual. 
The team also observed similar effects in rats 
exposed to a combination of the GM maize 
and the herbicide it is designed to tolerate, 
and to the herbicide alone’. Not surprisingly, 
these results sparked debate among the pub- 
lic and many in the scientific community. 

The public concern is easy to compre- 
hend. The images of tumour-ridden rodents 
included in the study tapped into existing 
controversies over the safety of GM crops 
going back several decades, especially in 
Europe. Scientific reactions were intense 
for the opposite reason: other research on 
exposure to GM foods has not shown such 
pathological patterns’. 

Since the paper was published, members of 
the scientific community have found weak- 
nesses in the analysis, which, in their view, 
call the conclusions into question. But some 
damage may have been done: an opinion 
poll a few days after the paper's publication 
— although not specifically mentioning the 
study itself — showed that 79% of French 
people were worried about the possible 
presence of GM organisms in their food, 
compared with 65% in 2011 (ref. 3). 

In my opinion, this episode highlights a 
major issue: there is a need for extra rigour 
in research whenever it tackles sensitive top- 
ics such as GM crops and food. Until science 
moves to the forefront, I believe the debate 
risks remaining mired in confusion and 
misinformation, no matter what improve- 
ments are made in public engagement. 

Reactions to this latest GM study were 
reinforced by an unusual communication 
campaign. The authors informed a few jour- 
nalists about the paper in advance, and asked 
them to sign an agreement saying that they 
would not interview outside experts until 
after the story had appeared in a French 
weekly magazine, Le Nouvel Observateur. 
In the following weeks, two books and a 
documentary based on the conclusions 
of the study were released. The effect was 
immediate: in my view, there was an initial 
wave of one-sided alarming news reports 
and increased distrust towards “the system”. 

The French government and the European 
Commission immediately asked national and 
European food-safety agencies to review the 
publication. Most have now released prelimi- 
nary or final reports pointing out weaknesses. 


Field trials of a GM grapevine rootstock were 
destroyed by activists in France in 2010. 


These include a lack of relevant statistics — 
for example, on mortality and tumour inci- 
dence — resulting from the use of too few 
animals per group for long-term studies and 
a lack of plausible biological mechanisms for 
understanding the alleged effects. In my view, 
the paper seems to have failed to convince 
many in the scientific community, despite 
asking legitimate questions about long-term 
toxicity tests and the effects of a herbicide. 
Study author Gilles-Eric Séralini, of the 
University of Caen, has agreed that more 
animals would render the study more robust, 
but says that his findings are supported by 
many observations; he has also accused many 
detractors of conflicts of interest. 

How do we address the questions about the 
impact of GM crops, and how do we prevent 
this kind of negative reaction? First, I believe 
that we need to publicly fund more risk- 
benefit analyses of GM crops. We also need 
more interdisciplinary studies of GM foods, 
especially on health impacts in animals and 
humans. A review’ identified 24 papers fea- 
turing trials of feedstuffs containing various 
GM crops, in which the trials lasted more 
than 90 days or were done in more than one 
generation. By contrast, more studies have 
charted the environmental impacts of GM 
crops, including long-term, large-scale studies 
and meta-analyses (see, for example, refs 4,5). 

Research into GM crops can be difficult. 
For example, at the French National Insti- 
tute for Agricultural Research (INRA) in 
2005, we launched a programme to test the 
environmental impact of a GM grapevine 
rootstock that was supposed to be resistant 
to grapevine fanleaf virus, which causes large 
economic losses. The project was funded only 
by public money; it did not aim to develop a 
commercial variety. There was a public con- 
sultation about it, moderated by specialist 


social scientists®’, and stakeholders were 
transparently informed. Nonetheless, activ- 
ists destroyed the crop in August 2010. 

Second, research must always follow 
proper academic standards. In my opinion, 
any breach in the rigour and traceability 
of the scientific workflow — stating the 
research question and hypothesis; designing 
adequate experiments; using relevant data 
analysis and modelling techniques; allowing 
outside experts to comment on the results — 
could, I fear, lead to a lack of trust. 

Publishing a paper is not the end of 
the story. New data and results should be 
tested by the scientific community until a 
convergent corpus of evidence is reached by 
independent teams. I believe that publiciz- 
ing and sharing raw data and disseminating 
new methods are thus extra crucial stages. 
The more unexpected the results, the more 
rigorous this workflow should be. 

Third, the distinctions between scientific 
research, risk assessment and risk manage- 
ment must be clearly articulated. This is vital 
for public trust in the long term. The GM 
maize case has generated the feeling that 
research organizations should do risk-assess- 
ment trials. I disagree: they should focus on 
elaborating and testing new methods — such 
as how we can use metabolomics to get early 
predictors of metabolic impacts — and on 
dissecting underlying biological mechanisms. 
In my view, risk assessment should remain 
within the province of dedicated agencies 
using specific guidelines and impartial pro- 
cedures, albeit informed by the best research, 
as happened with the chemical bisphenol A*”. 

As scientists, we must champion the 
multiple concerns of society, even when they 
make a contradictory call for more innovation 
as well as more precaution. = 
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The Colosseum in Rome: a much-needed €25-million conservation project is set to begin next month. 


CONSERVATION 


Shoring up 
the wonders 


Forty years on from UNESCO’s world heritage convention, 
Alison Abbott contemplates the state of Italy’s vast legacy. 


ven in the brightest sunshine, Mount 
hae casts a threatening shadow 

over Naples in southern Italy. Resi- 
dents live in fear of the volcano, whose 
murderous eruption in AD 79 propelled lava 
and ash over surrounding towns, including 
Pompeii and Herculaneum, burying them. 

For centuries, the ancient towns remained 
safely sealed from the elements; they were 
rediscovered only in the eighteenth century. 
Archaeological excavations since then have 
revealed much about life in Roman times, but 
Pompeii in particular dominates the public’s 
imagination. The 66-hectare site, two-thirds 
of which has been excavated, receives more 
than 2 million visitors a year. Many Neapoli- 
tans make their living thanks to the tourist 
industry created by the catastrophe. 

But a new shadow has fallen on the sites. 
The collapse of some structures during the 
past few years — including Pompeii’s Schola 
Armaturarum or ‘House of the Gladiators’ 
in November 2010 — has raised questions 


about whether Italy is taking good enough 
care of its considerable cultural heritage. 
Concerns have been inflamed by a well- 
publicized series of calamities, small and 
large, at several other sites in Italy, including 
stones falling from the walls of the Colos- 
seum a year ago. 

Italy has the largest number of entries 
of any country on the World Heritage List, 
which was created on 16 November 1972 
under the UNESCO Convention Concern- 
ing the Protection of the World Cultural and 
Natural Heritage. Along with Italy’s place on 
the list comes moral pressure to safeguard 
its heritage — artefacts, artworks and archi- 
tectures from the Etruscan and Roman peri- 
ods, through the Renaissance and up to the 
twentieth-century dictatorship of Mussolini, 
which put an end to Italian glory. Minor 
amphitheatres in remote towns like Cassino 
and specialized scientific collections such as 
the University of Pavia'’s eccentric hoard of 
pathological specimens are considered no 
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less important than better-known items. 

It is often forgotten just how much Italy is 
doing right where its heritage is concerned. 
Many important sites and artworks are in 
fine shape — for example, the painstakingly 
restored Last Supper by Leonardo da Vinci 
in Milan. Between 1977 and 1999, under 
the guidance of Pinin Brambilla Barcilon, 
conservators used techniques such as chemi- 
cal analysis of different layers of the fresco 
in microscopic core samples and infrared 
reflectoscopy to see below the surface of the 
fresco without harming it. Indeed, Italy has 
several world-class conservation and resto- 
ration institutes, including the International 
Centre for the Study of the Preservation and 
Restoration of Cultural Property in Rome and 
the Opificio delle Pietre Dure in Florence. 

Yet political support for culture in Italy 
dwindled from the 1980s onwards, and 
funds continue to shrink alarmingly. Retir- 
ing staff working at cultural heritage sites 
are not replaced. The proportion of the state 
budget dedicated to culture shrank from 
0.39% in 2000 (more than €2 billion, or 
US$2.6 billion) to 0.19% in 2011 (less than 
€1.5 billion). 

The consequences are evident at the 
Vesuvius archaeological sites. Any city will 
quickly deteriorate if its roofs are not fixed 
and its drains not cleared. Over the decades, 
water from below and above has caused salts 
to leach through walls, destabilizing them, 
damaging mosaics and destroying frescos. 

The problems are as much managerial 
as financial. Pompeii acquired substantial 
subsidies through the European Union 
(EU) Structural Funds in the 1980s and 
1990s. But instead of using those to conserve 
the exposed remains, the superintendency 
embarked on glamorous new excavation 
work to impress politicians. This went so 
badly that, at one point, the EU suspended 
payment. 

In 1997, just 16 out of the Pompeii super- 
intendency’s 711 staff were archaeologists, 
architects and art historians; in the era of 
computers, 34 were typists. Successive gov- 
ernments went on to shamelessly ignore 
Pompeii’s autonomy. The 2006 govern- 
ment siphoned off €30 million of Pompeii’s 
income for spending elsewhere. In 2008, 
the government declared a one-year state of 
emergency for the site, later extended by a 
further year. 

Responsibility for all aspects of cultural 
heritage in Italy is centralized within the 
ministry of culture, whose regional offices, 
called superintendencies, mediate local 
needs and prevent unauthorized activities. 
This system protects 
heritage from crass 
development, but can 
be damagingly slow in 
operation. Moreover, 
staff at all sites — from 
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archeologists to ticket collectors — are gov- 
ernment employees with jobs for life. The 
inflexibilities make long-term planning 
almost impossible. 

During the past decade or so, successive 
governments have experimented with new 
approaches to funding conservation, with 
some clear successes. The Egyptian Museum 
in Turin has, since 2005, been managed by 
a private foundation. This has renovated 
and modernized the museum, to interna- 
tional acclaim. And shoe magnate Diego 
Della Valle is paying €25 million for urgently 
needed conservation work on Rome's Colos- 
seum that is being directed by the ministry. 
In return, he gets exclusive rights to use the 
image of the edifice to promote his products 
for 15 years. Alarmed academics have tried 
to equate such activities with privatization. 
But the heritage itself remains firmly in the 
possession of the state, which retains full 
power to control conservation or restora- 
tion projects. 

Now the Pompeii superintendency has a 
further €105 million of EU structural funds 
to spend on securing its site, efficiently and 
effectively, under stern oversight — and 
within just three years. This will be a chal- 
lenge, although the project acquired a fur- 
ther 20 or so architects and archaeologists 
this year. 

Herculaneum, fortunately, won the support 
of philanthropist David W. Packard, son of the 
co-founder of the Hewlett-Packard infor- 
mation-technology company. His Packard 
Humanities Institute in Los Altos, California, 
has been running the Herculaneum Conser- 
vation Project in partnership with the super- 
intendency and the British School at Rome 
for the past 11 years. This international, inter- 
disciplinary team of archaeologists, architects 
and conservationists do unglamorous practi- 
cal conservation work. This could be mend- 
ing the ancient drainage networks, repairing 
roof coverings or driving out the pigeons 
whose voluminous, acidic excreta destroy 
frescos. The work is mostly low-tech — for 
example, the best solution they’ve found for 
the pigeons is to encourage falconers to visit 
the site regularly. 

The Herculaneum project has inspired at 
least one other consortium of foreign scien- 
tists to bid to help to conserve and restore 
some frescoed houses in Pompeii, working 
in partnership with the Italians. 

Such respectful international support for 
Italy’s cultural heritage is fundamental. But 
the country will have to help itself by relax- 
ing outdated labour laws and modernizing 
management of its cultural heritage system- 
atically. Italy can’t do much about Vesuvius’ 
shadow. It can do a lot about the political 
shadows it casts on itself. m SEE EDITORIAL P.302 


Alison Abbott is Nature’ senior European 
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Books in brief 


Planet Without Apes 

Craig B. Stanford HARVARD UNIV. PRESS 272 pp. $25.95 (2012) 

Will electronic gadgetry bring down the great apes? The link may 
seem surreal, but in this study of the plight of gorillas, chimpanzees, 
orangutans and bonobos, primatologist Craig Stanford reveals 

how mining coltan, a mineral used in electronics, destroys primate 
habitats and fuels the illegal bushmeat trade. In his wide-ranging 
call for action, Stanford — co-director of the Jane Goodall Research 
Center in Los Angeles, California — lays out the critical threats, 
arguing that humanity’s closest cousins are viewed as savage ‘others’ 
and subjected to a genocidal urge last seen in the colonial era. 


Jefferson’s Shadow: The Story of His Science 

Keith Thomson YALE UNIV. PRESS 322 pp. $30 (2012) 

Architect, philosopher, critic of slavery, slave-owner: the 
contradictions of American ‘founding father’ Thomas Jefferson are 
well known. That he was a scientist is not. Natural historian Keith 
Thomson redresses the balance in this finely wrought biography. 
Immersed in the work of Isaac Newton and Francis Bacon, Jefferson 
was arguably the most clued-up American naturalist of his time. 
This scintillating intellectual traced climate fluctuations, delighted in 
data tables, pored over fossils and helped to introduce the nation to 
palaeontology, geography, scientific archaeology and climatology. 


A Single Sky: How an International Community Forged the Science 
of Radio Astronomy 

David PD. Munns MIT PREss 264 pp. $34 (2012) 

During the past 60 years, radio technology has transformed 
astronomy from a venerable practice reliant on visible light to an 
astounding new window on the cosmos. As historian David Munns 
reveals, it was all down to an international network of scientists who 
defied the rivalries of the cold war to ensure collaborative exploration 
of a ‘single sky’. This remarkable science, forged by American, 
British, Australian and Dutch radio astronomers, ultimately led to 
the mapping of the Milky Way. 


Newton and the Origin of Civilization 

Jed Z. Buchwald and Mordechai Feingold PRINCETON UNIV. PRESS 
544 pp. £34.95, $49.50 (2012) 

Isaac Newton spent most of his 84 years in pursuit of knowledge 
— mathematical to metaphysical. In this tome, historians Jed 
Buchwald and Mordechai Feingold unveil yet another strand: 
historical chronology. When Newton’s Chronology of Ancient 
Kingdoms Amended was published in 1728, it drew fire for its 
dramatic revisions to timelines of civilizations past. Yet Newton, 
the authors show, approached the study — using astronomy and 
population dynamics — with the same rigour he brought to science. 


Chasing Doctor Dolittle: Learning the Language of Animals 


ane Con Slobodchikoff ST. MARTIN’S PRESS 320 pp. $25.99 (2012) 
HASING An alarmed prairie dog can recognize and communicate the 
x colour, shape, size and species of a predator. So says biologist Con 
CTOR Slobodchikoff, who — after 25 years of studying these hefty ground 
D irr squirrels of the US grasslands — posits that animals have 
{ LE | language. He bases his theory on a physiological and structural 


system not unlike the skeletal system that has parallels in humans 
and other vertebrates (think of human vocal chords and the avian 
double syrinx). 
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| COMMENT | BOOKS & ARTS 


Curtains for space opera? 


Since July, astronomers have killed off one trope of science fiction and given fresh life 
to another. Leigh Phillips gets Mars Trilogy author Kim Stanley Robinson’s reaction. 


arnard’s star is a star indeed. A member 
B of the second closest star system to the 
Sun at 6 light years (1.84 parsecs) away, 
it pops up all over twentieth-century science 
fiction, from classic comics to Asimov. 
Dutch-American astronomer Peter van de 
Kamp made the first modern claim to have 
spotted an exoplanet there in 1963, having 
studied the star since 1938. He thought that 
he had discovered wobbles in the position of 
Barnard that indicated a Jupiter-class planet 
in orbit around it. In 1969, van de Kamp 
revised his findings, positing two planets 
— one slightly bigger than Jupiter and one 
slightly smaller. But it wasn't long before other 
astronomers challenged the claims, suggest- 
ing that van de Kamps ‘discovery’ was merely 


an artefact of upgrade work at his observatory. 

Since van de Kamp’s time, the Barnard 
‘system’ has been a staple of sci-fi, from 
short stories and novels, to films and televi- 
sion series. In Douglas Adams’ The Hitch- 
hiker’s Guide to the Galaxy series and Arthur 
C. Clarke’s The Garden of Rama (Bantam, 
1991), it is a way station for interstellar 
travellers. Michael Moorcock uses an imag- 
ined planet orbiting the star as the site of 
a refugee camp for humans fleeing social 
breakdown on Earth. For Isaac Asimov, 
a Barnard-system 
planet is home to 
invertebrate marine 
animals. In a series of 
comic-book strips in 


> NATURE.COM 
See Nature's science 
fiction special at: 
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the 1970s, Will Eisner sited humankind’s 
first contact with an extraterrestrial civi- 
lization on a planet in the system. And in 
the short-lived Battlestar Galactica spin-off 
series Galactica 1980, the dastardly Cylons 
are believed to be hiding there. 

Recently, the status of this sci-fi staple 
itself wobbled. In August, a survey by a team 
of eight astronomers, led by Jieun Choi of 
the University of California, Berkeley, and 
covering 25 years’ worth of measurements, 
concluded that Barnard’s star does not have 
any planets — Earth-size or otherwise. 

Two months later, astronomers had better 
news for the sci-fi cognoscenti. On 17 Octo- 
ber, Xavier Dumusque at the University of 
Geneva in Switzerland and his team reported 


INGA NIELSEN 


in Nature that Alpha Centauri B, a mem- 
ber of our closest star system, just 4.3 light 
years away, has an Earth-sized planet orbit- 
ing — albeit with a tight, sun-hugging 
‘year’ of just 3.236 days, far from the pre- 
sumed habitable zone (X. Dumusque et al. 
Nature 491, 207-211; 2012). 

This was sure to resonate with read- 
ers of Stansilaw Lem, Robert Silverberg, 
Philip K. Dick and, again, Asimov and 
Clarke, who all made use of the Alpha 
Centauri system in their fiction. It also 
appeared in the television series Buck 
Rogers in the 25th Century, Doctor Who 
and Star Trek. Indeed, Zefram Cochrane, 
the Star Trek character who ‘invented’ the 
warp drive, lived there. 

So what do these two scientific devel- 
opments mean for science fiction? Kim 
Stanley Robinson, author of the bestsell- 
ing Mars Trilogy, takes a radical view. 
He suggests that we get over the idea of 
interstellar travel altogether: a probe 
would take 28,000 years to get to Alpha 
Centauri. “We can’t go fast enough to get 
to any of these places,” he says. 

Barnard’s star was once “the place 
for nearby space’, Robinson says, as his 
novel Icehenge (Ace, 1984) — in which 
characters build a starship headed for 
it — attests. Now that researchers have 
identified some 840 exoplanets, and 
NASA’s three-year-old Kepler space tel- 
escope has spotted 2,320 candidate plan- 
ets, “there may never again be a single 
default destination’, Robinson continues. 

In his recent book 2312, which ima- 
gines humanity 


three centuries 4 probe would 
fromnow, spread rajee 28 000 
across terra- : 

years to get to 
formed planets, “4 tyha Centauri. 
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moons in our more realistic. 


own Solar Sys- 
tem, Robinson 
writes frankly about the galactic hinter- 
land we inhabit. “The stars exist beyond 
human time, beyond human reach,” says 
the narrator. “We live in the little pearl of 
warmth surrounding our star; outside it 
lies a vastness beyond comprehension. 
The solar system is our one and only 
home?’ 

Of the idea that we are destined to 
go to the stars and inhabit, if not the 
whole Universe, maybe the whole gal- 
axy, Robinson cautions “it’s a fantasy, of 
power, transcendence and a kind of spe- 
cies immortality. We have to get more 
realistic” m= 


Leigh Phillips is an International 
Development Research Centre fellow at 
Nature. 

e-mail: leigh. phillips@gmail.com 


COMPUTER SCIENCE 


BOOKS & ARTS | COMMENT | 


Virtually there 


John Gilbey applauds a call for the digital to join 
the physical, biological and social in science. 


examines the case for computing to enter 

the pantheon of great scientific domains 
alongside the physical, biological and social 
sciences. The centenary year of computing 
pioneer Alan Turing’s birth seems a fitting 
moment to put the idea to the test. 

The study of computing, dated from 
Turing’s work, is only about 80 years old. 
It is variously claimed by engineering, 
physics, mathematics, linguistics and 
psychology — or seen merely as a support- 
ing technology whose academic roots are 
irrelevant. Despite this, computing has 
arguably made more, and deeper, inroads 
into the daily life of humanity during the 
past 50 years than any other academic dis- 
cipline, underlying a series of life-chang- 
ing products. Imagine life today without 
mobile-phone networks, the Internet or 
medical imaging. 

Drawing on his background in artificial 
intelligence, robotics and cognitive architec- 
ture, Rosenbloom leads us through the past, 
present and potential futures of computing 
as an academic discipline and demonstrates 
its linchpin position in a multidisciplinary 
environment. 

He uses a novel ‘relational’ approach, 
unveiling the structures and connected- 
ness across the various subfields of comput- 
ing by looking at types of implementation 
and interaction within and between the 
existing major domains of science. To help 
clarify these relationships, Rosenbloom uses 
metascience expression language, a nota- 
tion that facilitates the representation of the 
multidisciplinary fields and topics within 
science. Metascience expression offers 
both a technical context for Rosenbloom’s 
anecdotal material and a framework within 
which to debate the core tenets of the argu- 
ment. Non-specialists who persevere with 
these sections of the book will benefit 
from a much more 
structured under- 
standing of the 
make-up of the 
computing sciences. 

Rosenbloom fields 
many examples of com- 
puting innovation — includ- 
ing immersive display technologies, 
neurally controlled prosthetics, and 
quasi-autonomous military systems such 
as advanced unmanned aerial vehicles, or 
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drones. These demon- 
strate that traditional 
demarcations between 
real and virtual envi- 
ronments will blur 
over the coming years 
as interfaces between 
human and machine 
are integrated to the 
point of invisibility. 
One example is the 
rapidly expanding 
field of augmented 
reality systems, early 
versions of which are 
already embedded in smartphones and 
tablets. 

Rosenbloom’s reasoned analysis should 
help academia and the wider technical 
community to ensure that this transition is 
managed so as to deliver benefits to human- 
ity in general. Otherwise, that enormous 
and life-changing power will be unfairly 
subjugated by a small minority of interests 
— technical, economic or political. 

The text is permeated with a sense 
of delight in the opportunities offered 
by advances in the computing sciences. 
Rosenbloom offers elegant examples of 
the innovative ways in which computing 
developments and mature research areas 
can have hugely productive synergy — such 
as in surgical robotics and sophisticated 
prosthetic systems. 

On Computing is an unu- 
sual, and welcome, mix of 
conventional academic 

text and personal 

odyssey. Any work cit- 
ing Jane Austen and 

Richard Feynman in 

the same chapter eas- 

ily passes my test for 
an interesting inter- 
disciplinary read. Much 
more, this book offers 
an innovative set of tools 
that could kick-start debate 
and research on the future 
structure of the sciences. = 


7 
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On Computing — 
The Fourth Great 
Scientific Domain 
PAUL S. ROSENBLOOM 
MIT Press: 2012. 

312 pp. $35, £24.95) 
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Standardize the diet 
for zebrafish model 


The standardization of diets for 
laboratory rodents in the 1970s 
minimized the contribution of 
unintended nutritional effects to 
experimental outcomes and made 
comparison between experiments 
more reliable (Nature 491, 31-33; 
2012). Despite success as a model 
species, zebrafish (Danio rerio) 
are still fed assorted commercially 
available diets of largely 

unknown nutrient composition. 
Itis time to develop a standard 
formula diet for zebrafish in the 
laboratory, applying the extensive 
knowledge of fish nutrition from 
aquaculture. 

We analysed the iron content 
of four commercial zebrafish 
diets and found that they 
contained 0.6-4.6 grams of 
iron per kilogram (gkg™') of 
dry feed. Because fish have a 
maximum iron requirement of 
0.2 gkg™ dry feed, these higher 
concentrations could be toxic. In 
salmon, for example, differences 
in dietary iron affect the 
cytochrome P450 detoxification 
system (A. Goksoyr et al. Can. 

J. Fish. Aquat. Sci. 51, 315-320; 
1994). One zebrafish diet was 
also deficient in vitamin C —a 
combination that would alter 
cellular redox status and could 
influence study parameters such 
as disease progression. 

Failure to control for such 
variables compromises the 
validity of outcomes from 
zebrafish receiving different 
nutrition in an otherwise identical 
experiment. 

Sam Penglase National Institute 
of Nutrition and Seafood Research 
(NIFES), Bergen; and University 
of Bergen, Norway. 

spe@nifes.no 

Mari Moren, Kristin Hamre 
NIFES, Bergen, Norway. 


Clean stoves already 
in use in rural India 


The health and pollution 
problems caused by primitive 
heating stoves (Nature 490, 343; 


2012) are already being addressed 
in one rural Indian community. 

In the state of Arunachal 
Pradesh in the eastern Himalayas 
— a biodiversity hot spot 
(N. Myers et al. Nature 403, 
853-858; 2000) — most people 
use biomass fuel as their primary 
source of energy. It is burnt ina 
safe, energy-efficient and smoke- 
free stove called a chulha. 

This portable iron stove 
is enclosed, equipped with 
a heat-intensity control, an 
ash-collection tray and an 
exhaust pipe. It costs just 
1,500-3,000 rupees (US$28-56), 
and has a thermal efficiency of 
60%, compared with 6-8% for 
traditional stoves. This translates 
into a significant saving of 
around 300 kilograms of wood 
fuel (biomass) equivalent per 
year (J. S. Rawat et al. Curr. Sci. 
98, 1554; 2010). 

These improvements 
have proved to be a boon for 
rural women living in poor 
socio-economic conditions. 
Sudhir Kumar Jaypee University 
of Information Technology, 
Waknaghat, Himachal Pradesh, 
India. 
sudhir.syal@juit.ac.in 


Union improves 
postdocs’ rights 


As president of the union 

UAW Local 5810, which 
represents more than 

6,000 postdocs at the University 
of California, I recognize efforts 
by the US National Postdoctoral 
Association to improve our 
working conditions (Nature 
489, 461-463; 2012). But our 
union's experience has shown 
that recommending changes is 
not enough: organizations need 
formal negotiating power to 
make them effective. 

UAW Local 5810 has 
bargained collectively for 
measurable improvements 
for postdocs. These include 
a contract with a minimum 
salary scale that matches that 
of the US National Institutes 
of Health’s National Research 


Service Award, a stable and 
comprehensive benefits plan, 
greater job security and the right 


to career-development resources. 


The parental-leave policy 
negotiated by the union should 
help to address the under- 
representation of women in 
science and engineering and 
retain top talent. Also, our 
advocacy for an increase in 
federal research funding has 
earned the support of thousands 
of postdocs and more than 
20 members of Congress from 
California. 

These hard-won successes 
should be an encouragement to 
postdocs everywhere to organize 
union support (Nature 467, 
739-741; 2010). 

Neal Sweeney University 
of California, Santa Cruz, 
California, USA. 
president@uaw5810.org 


Biomedical network 
in South America 


The organization MERCOSUR 
— dubbed the Common Market 
of the South — promotes free 
trade and movement of goods, 
people and currency within 

a trade bloc of five countries 

in South America. The 
organization has now funded 

a large biomedical network 
spanning research institutes 

in Argentina, Brazil, Paraguay 
and Uruguay. We hope that 
this unprecedented initiative 
will encourage other regional 
scientific endeavours in South 
America. 

The idea of the network is 
to help each other develop 
innovative biomedical 
projects that have potential 
for translational medicine. 

The network will encourage 
contributions from young 
investigators. 

It aims to study the biological 
and epidemiological aspects of 
diseases that have social and 
economic impact; to create 
biotechnology platforms for 
clinical developments; and to 
build up human resources and 


technology to a high standard. 
In recognition of the 
importance of investment in 
science and technology on the 
development and welfare of 
communities, MERCOSUR 
will provide US$7 million, with 
a further $3 million coming 
from national funding. The 
MERCOSUR funding will come 
from its FOCEM budget, better 
known for supporting local 
construction projects such as 
roads or hospitals. 
Eduardo Arzt* Biomedicine 
Research Institute of Buenos Aires, 
CONICET-Partner Institute of 
the Max Planck Society, Buenos 
Aires, Argentina. 
earzt@fbmc.fcen.uba.ar 
*On behalf of 4 co-authors 
(see go.nature.com/s6ud4k for 
a fulllist). 


Environmental stress 
seen since antiquity 


Lagree that biologists and 
sociologists need to get their 
acts together to determine the 
effects of environmental stress 
on our behaviour (Nature 490, 
143; 2012). You mention Francis 
Galton as the first to define its 
terms, but this discussion was 
going on long before his and 
Charles Darwin's time. 
‘Environmental determinisnY 
has been an issue for at least 
2,500 years. For example, it was 
discussed by Hippocrates and 
Strabo, by the Muslim historian 
Ibn Khaldun in his fourteenth- 
century book Muqaddimah 
and by the French philosopher 
Montesquieu in his political 
treatise The Spirit of the Laws 
in 1748. 
Frank Vereecken La Hulpe, 
Belgium. 
vereecken@yahoo.com 
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OBITUARY 


Edward Donnall Thomas 


(1920-2012) 


Immunologist who won Nobel prize for bone-marrow transplants. 


dward Donnall (Don) Thomas has 
B= called the father of bone-marrow 

transplantation. Until the 1970s, 
every reported human marrow transplant 
had failed, and prominent immunologists 
declared that the barriers between individu- 
als could never be crossed. Thomas persisted 
and eventually succeeded, sharing a Nobel 
prize for the feat in 1990. Since 1969, around 
one million patients with otherwise fatal 
blood disorders have received bone-marrow 
transplants. 

Thomas died on 20 October 2012, aged 92, 
of heart failure. He was born in Mart, Texas, 
and his father was a general practice doc- 
tor, whom he often accompanied on house 
calls. Thomas received his bachelor’s and 
master’s degrees in organic chemistry from 
the University of Texas at Austin in 1941 and 
1943, respectively. In 1942, he married fel- 
low student Dorothy (Dottie) Martin. She 
helped to manage his research and papers 
throughout his career — the late George 
Santos of Johns Hopkins University School 
of Medicine in Baltimore, Maryland, once 
said: “If Dr Thomas is the father of bone- 
marrow transplantation, then Dottie 
Thomas is the mother.” The couple had three 
children, two of whom are physicians. 

In 1946, Thomas received his MD degree 
from Harvard Medical School in Boston, 
Massachusetts, followed by residency train- 
ing at the city’s Peter Bent Brigham Hospital 
and then service in the US Army. In 1950, 
he returned to the area as a research fellow 
at the Massachusetts Institute of Technology 
in Cambridge, and then as chief resident and 
instructor in medicine at Harvard. 

In 1955, Thomas was appointed physician- 
in-chief at the Mary Imogene Bassett 
Hospital in Cooperstown, New York. Here 
he became fascinated by the discovery that 
rodents given a lethal dose of radiation could 
be rescued by an intravenous infusion of 
marrow cells from a donor. In 1957, Thomas 
treated a patient with leukaemia using high 
doses of total-body irradiation to wipe out 
the cancer, and then gave them an infusion 
of marrow cells from an identical twin. The 
transplant was at first successful, although 
the patient later died from a recurrence of 
the leukaemia. 

Meanwhile, the medical literature was 
charting numerous cases of patients with 
blood disorders who had been treated using 
marrow transplantation from healthy fam- 
ily members. All the patients died from 
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infections or severe immune reactions that 
were not predicted from studies in inbred 
rodents. Many investigators left the field, 
pronouncing it a dead end. 

Thomas did not give up. In 1957 he 
began experimenting in dogs. Like 
humans, dogs have unusual phenotypic 


diversity, a well-mixed gene pool and can 
develop haematological diseases, including 
non-Hodgkin lymphoma. In late 1963, 
Thomas set up his laboratory at the United 
States Public Health Service (USPHS) Hos- 
pital in Seattle, which was affiliated to the 
University of Washington's medical school. 

When I joined his small band of scientists 
in 1965 as a research fellow, transplantation 
was not a widely known concept. Indeed, 
the university's print shop once produced 
letterhead stationery for us that read “Divi- 
sion of Hematology and Transportation”. 
A remarkable thing about Thomas's lead- 
ership style was that he was happy to give 
people like me a free hand in innovating, as 
long as it helped the patients. 

Under Thomas's guidance, we spent the 
1960s developing high-intensity irradiation 
treatments to eradicate patients’ cancer cells 
and establishing the importance of tissue 
matching for transplant outcome. To con- 
trol and treat graft-versus-host disease, we 
developed drug combinations to suppress 
the immune system and produced antibodies 
against human lymphocytes — which once 
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involved a 1.5-hour chase of an antibody- 
producing horse. Our total-body irradiation 
sources were set up in a Second World War 
underground bunker. 

Work with patients began in 1969 at the 
USPHS hospital. Initial survival rates were 
low, and unexpected problems required 
going back and forth between bench and 
bedside — something that remained a 
hallmark of Thomas’s work. By 1979, after 
performing a number of transplants, we 
were able to describe a phenomenon called 
graft-versus-tumour effects, in which donor 
lymphocytes help to eliminate residual 
malignant cells. There are patients who 
received transplants in 1971 still alive today. 

Progress has been slow but steady. Early 
marrow donors were siblings. In 1979, the 
first patient with leukaemia was successfully 
treated with a transplant from an unrelated 
donor using new immunosuppressant drugs. 
This success sparked Thomas and others 
to set up international marrow-donor 
programmes. Transplant outcomes have 
improved, with survival rates for some dis- 
eases, such as aplastic anaemia, close to 95%. 

In 1972, the US government closed the 
USPHS hospital. This prompted the found- 
ing of the private Fred Hutchinson Cancer 
Research Center (the Hutch), with close ties 
to the University of Washington’s medical 
school. Thomas headed the medical oncol- 
ogy divisions at both. 

Over the years, Thomas’s team trained 
hundreds of young investigators. As one of 
them put it: “Virtually every major trans- 
plant centre in the world got its start by send- 
ing someone to train under Don Thomas.” 
Owing to his influence, the Hutch’s clinical 
focus has always been the patient, and its 
approach, one of teamwork. After retiring 
in 1989, Thomas continued writing manu- 
scripts, lecturing and serving as an ambas- 
sador for the place. 

Besides science, Don and Dottie had a 
passion for fishing and hunting, the fruits of 
which they shared at intimate dinners with 
colleagues and friends at their modest home. 
Reserved, hardworking and uncompromis- 
ing, Thomas generously attributed much of 
his success to colleagues, nurses, support 
staff, patients and their families. m 
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Birds of a feather 


A phylogenetic reconstruction of the diversification of birds across space and time provides a novel resource for 
evolutionary studies. But the methods used to construct this tree, and what insights can be inferred from it, are 
a source of debate. Two evolutionary biologists provide opinions on how to draw the lines. SEE LETTER P.444 


THE PAPER IN BRIEF 

@ Geographical and ecological features, such 
as climate and niche opportunities, influence 
the evolutionary processes that generate 
new species. 

@ Jetz et al.’ (page 444 of this issue) 
combined genetic and taxonomic 
information to construct a phylogenetic 

tree of the almost 10,000 species of 

extant birds that also considers 


Disconnects in 
diversity 
ROBERT E. RICKLEFS 


ver since biologists began cataloguing the 

diversity of life on Earth, they have sought 
to understand the origin and maintenance of 
global patterns of species richness — for exam- 
ple, that life is most diverse where the climate 
is warm and wet or where mountains vary the 
landscape. Biological diversity reflects a bal- 
ance between the tendency of evolutionary 
lineages to form new species and the variety of 
living things that an environment can support. 
Furthermore, diversification depends on the 
availability of space, the dispersal of life forms 
among shifting continents, and variations in 
climate and resources at different locations and 
over time. Therefore, to understand the distri- 
bution of diversity is to interpret evolutionary 
diversification in historical and geographi- 
cal contexts — and this is the key to Jetz and 
colleagues’ remarkable accomplishment. 

By integrating their reconstruction of the 
ancestral relationships of all of the approxi- 
mately 10,000 known extant species of bird 
with maps of their distributions, Jetz et al. 
derived a detailed picture of average diversifi- 
cation rates over the surface of the globe. The 
patterns revealed are intriguing. One might 
expect more species where the diversifica- 
tion rate is higher, but the authors find that 


*This article and the paper under discussion’ were 
published online on 31 October 2012. 


their historical locations*. 

@ The authors report that the diversification 
rates of bird species vary across the globe, 
with greater differences in rates between 

the Eastern and Western hemispheres than 
across latitudinal lines. 

@ The phylogeny also reveals ‘hot spots’ of 
recent diversification in regions characterized 
by strong climatic fluctuation over the past 

5 million years. 


diversity and diversification rate correspond 
poorly around the planet — evolutionary 
lineages split more frequently, on average, in 
the Western than the Eastern Hemisphere, 
but not in the tropics compared with higher 
latitudes. They also find that the overall diver- 
sification rate is higher in passerines (song- 
birds) than in non-passerines (ducks, raptors, 
shorebirds and others), as expected given the 
species richness of the former group. But, 
surprisingly, the data show that the relative 
contribution of each group to the diversifi- 
cation rate differs between regions. Particu- 
larly intriguing is the relative evolutionary 
quiescence of modern passerines in Australia 
and New Guinea, where the passerines, now 
the largest group of birds, originated around 
60 million years ago” (Fig. 1). 

Biologists will debate whether Jetz and 
colleagues’ phylogenetic reconstruction is up to 
the task. Although some DNA-sequence infor- 
mation is available for around two-thirds of bird 
species, the genomes of only a few have been 
well sampled. Jetz et al. built their phylogeny on 
a backbone of 158 major bird clades whose rela- 
tionships had previously 
been defined? (a clade 
represents a ‘branch ofa 
phylogenetic tree, includ- 
ing an ancestor and all its 
descendants). To this, 
they attached new phy- 
logenetic detail, using a 
complex algorithm based 
on taxonomic distinc- 
tions to determine the 
placement of species for 
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which sequence data are not available. 

The result is perhaps not perfect, but it is 
probably the best possible for now, and is 
certainly the most ambitious. Moreover, the 
sequence data that would be required for a sub- 
stantial improvement might not be worth the 
effort, because any inference on rates of species 
production depends on how we circumscribe 
species. For example, the authors’ method for 
calculating diversification rate — on the basis 
of the lengths of branches close to the tips of 
the phylogeny — may provide inflated esti- 
mates in regions in which bird populations 
are finely distinguished at the species level, 
such as Europe and North America. So addi- 
tional sequencing without taxonomic revision 

might be an empty 


“The result is pat 

perhaps not The apparent 

perfect, but it absence of a latitu- 

Jour0b a bly the dinal gradient in the 
Pp 4 diversification rates 

pou — of birds is consistent 


with a recent analysis 
for mammals’. This 
result implies that the great species richness of 
tropical environments is a matter of age: Earth 
was mostly tropical before temperate and boreal 
environments began to expand around 30 mil- 
lion years ago. However, an assessment based 
on a different phylogenetic approach’ con- 
cluded not only that the recent speciation and 
extinction rates of birds both increase towards 
higher latitudes, but also that the difference 
between the two (the diversification rate) has 
nevertheless been higher in the tropics. These 
conflicting interpretations and other enig- 
matic patterns arising from Jetz and colleagues’ 
new phylogeny — for example, that extinction 
compared with speciation has been relatively 
infrequent, or that the overall rate of diversifi- 
cation has increased towards the present, par- 
ticularly with the expansion of cold and arid 
environments — will motivate further work for 
some time to come. 
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Figure 1 | Birds in space and time. These Cape sparrows (Passer melanurus) are 
one of around 5,000 species of songbird (passerine), which make up around half 
of the known species of bird. Jetz and colleagues’ have constructed a phylogenetic 


First steps 
for birds 


MARK PAGEL 


etz and colleagues’ bird phylogeny joins 
Jecvcral other attempts to reconstruct the 
history of entire classes of organism, including 
the 5,000 or so mammals’ and the roughly 6,000 
amphibian species’. These large phylogenetic 
trees have a value that extends beyond describ- 
ing the evolutionary relationships among a 
group of organisms: they grant unprecedented 
statistical power to attempts to reconstruct 
the probable historical events and processes 
of evolution’, such as our understanding of 
ancestral states, or rates of morphological 
change and speciation. This statistical power 
allows researchers greater confidence in ruling 
some proposals in and others out. But it also 
means that, if the phylogeny is wrong, it might 
confidently return wrong, biased or misleading 
answers to tests of evolutionary questions. 

And this is why we must receive Jetz and 
colleagues’ tree with a measure of caution. 
Rather than seeking to infer the avian tree 
from gene-sequence or other information, 
such as data on morphology and behaviours, 
Jetz et al. relied on the findings of previous 
studies to fix the tree's major outlines and the 
broad placement of most ofits species. In fact, 
the authors assigned positions in the tree to 
roughly one-third of the bird species on the 
basis of previous taxonomic classifications 
alone. As a consequence, the authors’ method 
never explores the possible universe of avian 
relatedness, and so we are left wondering 
whether there might be other trees that pro- 
vide equally good or even better descriptions 


= 4 —-= 


of avian evolution, given what we know about 
birds’ genetic, morphological and behavioural 
similarities and differences. 

The authors could have avoided this 
nagging worry by gathering data on as many 
species as possible and then inferring the tree 
by a more conventional route that did not place 
such strong prior constraints on the outcome. 
Curiously, for instance, Jetz et al. seem to have 
ignored recently published gene-sequence 
data’ on around 4,000 passerine bird species 
that were used to infer a tree of this group — 
the largest within the class Aves. 

Jetz and colleagues press their tree into 
service to study avian speciation rates, conclud- 
ing that these rates have increased through time 
and that they are, for 
example, higher in 


{3 
Bo eny is the Western than the 
wrong it mi ght Eastern hemisphere, 
aes with latitude generally 
confi dently having a smaller influ- 
return biased ence. These are the 
or misleading sorts of broad ques- 
answers to testS tions that such large 
of evolutionary trees should be used 
questions. ” to test, but here we 


must not lose sight of 
the statistical power this tree grants. For exam- 
ple, the authors’ methods for estimating specia- 
tion rates depend on how well one can estimate 
the lengths of the branches of the phylogeny 
in units of time. As has been previously dis- 
cussed”, there are reasons to be cautious about 
the branch lengths that are returned by methods 
used to infer time-dated trees, and even small 
biases in branch-length estimation, when inte- 
grated over so many species, can produce appar- 
ent trends that may or may not be real. 
So it is difficult to know what to make, in 
evolutionary terms, of the ‘hemisphere’ effect 


tree of all known avian species and mapped this to spatial data of species 


distribution to assess how bird diversity and diversification rates compare at 
different times in history and in different regions. 
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the authors report. It is not just that there are 
questions about the tree and the estimation of 
speciation rates — hemisphere boundaries are 
arbitrary constructions, and the regions they 
separate, whether north-south or east-west, are 
vast and each harbour a wealth of ecological and 
climatic conditions. Furthermore, when seeking 
associations between characteristics of species, 
or between characteristics of species and their 
environments, it is vital to identify multiple 
evolutionarily independent instances of the two 
traits changing in tandem". But Jetz et al. have 
not done this for the features of climate and ecol- 
ogy that they suggest influence speciation rates. 

So these are still ‘first steps’ towards a 
phylogeny of birds and our understanding of 
their rates of speciation. The important ques- 
tions Jetz and colleagues raise invite careful 
second steps towards confirming or refuting 
their proposals in this tricky area. m 


Mark Pagel is in the School of Biological 
Sciences, University of Reading, 
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CLIMATE SCIENCE 


Historical drought 
trends revisited 


Anew assessment of drought trends over the past 60 years finds little evidence 
of an expansion of the area affected by droughts, contradicting several previous 


estimates. SEE LETTER P.435 


SONIA I. SENEVIRATNE 


r | Vhe 2011-12 drought in the United States 
was reported as one of the most severe 
on record’, and led to economic losses 

of billions of dollars**. When such extreme 

events occur (Fig. 1), a common question 
is whether they might be a result of climate 
change’. Not only is this question far from 
trivial’, but it is also more complex for 
droughts than for most other climate extremes. 

On page 435 of this issue, Sheffield et al.° 

report an overestimation of historical drought 

trends obtained using a method that served as 
the basis for historical drought assessments 
made in the Fourth Assessment Report of the 

Intergovernmental Panel on Climate Change 

(IPCC)’*, as well as for analyses in more recent 

publications”. 

The authors’ results suggest a high uncer- 
tainty in global-scale drought trends over the 
past 60 years and little evidence of an increase 
in the total area affected by droughts. A recent 
IPCC assessment’ highlighted strong uncer- 
tainties in historical drought records and, in 
particular, inconsistent conclusions of studies 
carried out after the Fourth Assessment Report 
with respect to some regional drought trends. 
Sheffield and colleagues’ results are consist- 
ent with this assessment, but also suggest that 
methodological issues may partly explain 
reported conflicting results in the literature. 

Soil-moisture drought, which is of most 


relevance to agriculture, is induced by a deficit 
in the land water balance and is caused by lack 
of precipitation and/or excess evapotranspira- 
tion’ (Fig. 2). Evapotranspiration refers to the 
moisture loss from soils, either through plant 
transpiration (water extracted by the plants 
and lost through the leaves’ stomata) or by 
direct evaporation from moist surfaces (such 
as bare soils, lakes, rivers, or water stored on 
top of leaves). 

The key point addressed by Sheffield and 
colleagues is the contribution of evapo- 
transpiration as a driver for droughts. Most 
approaches compute actual evapotranspiration 
as a function of potential evaporation — that 
is, the evaporation occurring from bodies of 
water. In their study, the authors evaluate the 
extent to which different formulations used for 
computing potential evaporation can affect 
resulting estimates of historical drought trends 
that are computed by means of a drought indi- 
cator called the Palmer Drought Severity Index 
(PDSI). It should be noted that evapotranspi- 
ration depends not only on potential evapo- 
ration in the PDSI but also on soil-moisture 
availability”, an aspect not directly assessed 
by the authors. 

The PDSI approach is commonly used to 
assess drought trends’, although its validity 
for such applications is questionable because 
of several issues”. One of these is the usual 
practice of estimating potential evaporation 
as if it were solely dependent on temperature 


Figure 1 | Severe drought. A field of dried maize (corn) plants near Percival, Iowa, during the severe 
drought that struck the United States in 2012. 
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and latitude — an estimation known as the 
Thornthwaite-based formulation. This formu- 
lation neglects the role of several other climate 
drivers of potential evaporation, in particular 
wind speed, relative humidity and solar and 
long-wave radiation (Fig. 2). In addition, using 
temperature as a driver for drought overlooks 
the fact that, in dry conditions, the causal 
link is often reversed — that is, drought itself 
induces hot temperatures when the lack of soil 
moisture leads to a suppression of evaporative 
cooling”. 

In earlier studies, the rationale for using a 
formulation that relied only on temperature 
was the absence of credible long-term global 
data sets of other driving variables’. Sheffield 
et al. used a recently compiled data set for these 
variables to assess their impact on trends in 
potential evaporation and resulting drought 
computed using the PDSI. By applying the 
Penman-Monteith formulation, which con- 
siders these other drivers, the authors find 
much weaker trends than with the Thorn- 
thwaite formulation, and little evidence of an 
expansion of the area affected by drought in 
past decades’"°. 

The authors provide detailed evaluations of 
their results, as well as thorough explanations 
for the apparent conflict with previous stud- 
ies”"' that found few differences between PDSI 
drought trends obtained with the Penman- 
Monteith and Thornthwaite formulations 
(see the Supplementary Information to the 
paper®). On the basis of this analysis, the main 
discrepancies in these previous studies””’ 
seem to be rooted in slight differences in 
methodological approach, for example dif- 
ferences in the calibration periods used or the 
atmospheric forcing (data-set choice; data- 
gap filling, in some cases with climatological 
data; and consideration of spurious trends in 
data sets). Clearly, the quality and reliability of 
the forcing data sets° and lack of observations” 
remain an issue for any assessment related to 
drought trends. Because of these uncertainties, 
Sheffield and colleagues’ investigation has its 
own limitations, as the authors themselves rec- 
ognize. Therefore, their results will also need to 
be confirmed by other independent analyses. 

The 2007 IPCC Fourth Assessment Report's 
conclusion* that the area affected by droughts 
was “likely” to have increased in many regions 
since the 1970s had already been revised in a 
more recent IPCC special report on extreme 
events and disasters published earlier this year 
(the SREX Report)”. This report assessed that 
“[t]here are still large uncertainties regarding 
observed global-scale trends in droughts’, and 
highlighted regions in which drought trends 
have increased (southern Europe and West 
Africa) as well as those showing decreasing 
trends (central North America and north- 
western Australia)’. These reported regional 
trends agree with Sheffield and colleagues’ 
results. Furthermore, the SREX Report did not 
provide any assessment of previous changes 


Critical precipitation deficit 
Meteorological drought 
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Figure 2 | Drought drivers. Several factors, apart from temperature, affect the development of droughts. 
These should all be taken into account for assessing historical drought trends and the contribution of 
anthropogenic climate change to recent drought events. Red arrows indicate factors that contribute to 
drought, and blue arrows show factors that counteract it. Sheffield et al. find that results obtained for 
historical trends in the global land area affected by drought (even possibly affecting the sign of these trends) 
are heavily influenced by the meteorological drivers chosen to estimate them. (Adapted from ref. 5.) 


in the areas affected by droughts. Given 
Sheffield and colleagues’ findings, this metric 
of drought (that is, the total land area affected 
by drought) seems rather ill defined, because 
the error range of their Penman-Monteith- 
based PDSI estimates does not exclude either 
positive or negative trends in this quantity. 

The authors’ results confirm the complex- 
ity of the processes that lead to changes in 
drought conditions, also discussed in the 
SREX Report*. The findings imply that there 
is no necessary correlation between tempera- 
ture changes and long-term drought varia- 
tions, which should warn us against using any 
simplifications regarding their relationship. 
Furthermore, apart from the variables con- 
sidered in potential evaporation, the PDSI has 
several other shortcomings, in particular those 
resulting from the simplicity of its water-bal- 
ance model*””. These may explain a reported 
tendency of the PDSI approach (even when 
using the Penman-Monteith formulation’) 
to overestimate future drying trends when 
driven with climate-model output, compared 
with other estimates, including soil-moisture 
output of the climate models themselves. 
Future investigations should carefully consider 
these uncertainties, for instance in the context 
of palaeoclimate studies® and when relating 
specific changes in global mean temperature 
(for example, for the commonly discussed 2 °C 
target’’) to their effects on drought. = 
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ACTIVE MATTER 


Spontaneous flows and 
self-propelled drops 


The construction of in vitro assemblies of biological components that exhibit 
properties of living matter may shed light on the physical aspects of the dynamic 
reorganization that continuously occurs inside cells. SEE LETTER P.431 


M. CRISTINA MARCHETTI 


Le and gels can be made to flow by 


applying external forces at their bounda- 
ries. In this issue, Sanchez et al. report 
the observation of self-sustained flows that 
occur in the absence of external forces — a 
hallmark of living systems — in a model gel. 
When this ‘active’ gel is confined to the inte- 
rior of water droplets in a water-oil emulsion, 
the flows resemble the streaming used by cells 
to circulate their fluid content. Even more 
remarkable is the fact that, when one of these 
gel-filled droplets comes into contact with a 
hard surface, the self-driven flows of the con- 
fined gel drive the droplet along the surface*. 
To build their gel, Sanchez et al. sequen- 
tially assembled ingredients extracted from 
cells (Fig. 1). The first — and key — com- 
ponents are microtubules. These stiff, cylin- 
drical filaments are one of the constituents 
of the cytoskeleton, the polymer network 
that mediates force transmission and motil- 
ity in cells. Microtubule dynamics in cells is 
regulated by several proteins. Among these 
is kinesin, a motor protein capable of ‘walk- 
ing’ on individual microtubules by converting 
chemical energy from ATP fuel molecules into 


Kinesin 


Streptavidin 7? 


Polymer coil 


mechanical work. To construct the active units 
of their gel, the authors used a protein called 
streptavidin as a scaffold to assemble clusters 
of kinesins that could simultaneously bind to 
multiple microtubules. 

Finally, Sanchez et al. added nanometre- 
sized polymer coils to the solution. This step 
was essential to promote the formation of 
microtubule bundles that, in the presence 
of ATP, are continuously remodelled by the 
action of the crosslinking motor proteins. 
The coils induce attractive forces between 
the microtubules through a mechanism 
known as depletion interaction. This inter- 
action arises when two filaments come near 
to each other, because the narrow gap between 
them is no longer accessible to the polymer 
coils. This creates an osmotic pressure dif- 
ference that effectively acts as an attractive 
force between the filaments’. Sanchez and 
colleagues’ overall hierarchical assembly pro- 
cess was recently used by the same group to 
build artificial cilia that beat periodically and, 
when densely packed on a substrate, sponta- 
neously synchronize their beating pattern to 
create travelling waves’. 

At a moderate density, the microtubule 
bundles form a polymer network that is 


Figure 1 | Assembly of microtubule bundles. a, Sanchez et al.' combined the protein streptavidin with 
kinesin motor proteins that had been modified to bind to streptavidin (modification not shown). 

The proteins self-assembled to form clusters of several kinesin molecules in complex with streptavidin. 
b, The authors then added microtubule filaments and polymer coils to the mix. The polymer coils 
generated ‘depletio forces that pushed the microtubules together, promoting the formation of 
microtubule bundles mediated by the kinesin clusters. The bundles formed the basis of an ‘active’ gel 
—a material that generated self-sustained, internal flows of fluid in the absence of external forces. 
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internally driven by the action of kinesins. 
The network flows spontaneously and exhib- 
its mixing and enhanced transport, compared 
with its non-active counterpart (which is 
obtained when the ATP fuel runs out); this was 
demonstrated by the authors by tracking small 
particles suspended in the gel. 

At scales much larger than the typical bundle 
length (tens of micrometres), the rich dynam- 
ics of the system resembles that of complex 
fluids such as liquid crystals driven by exter- 
nally applied fields, but differs from them in 
that it occurs spontaneously as a result of the 
internal drive. This is the key property of active 
materials that are driven out of equilibrium 
not by forces applied at their boundaries, but 
rather by an input of energy on each unit, as 
in a suspension of swimming bacteria. Energy 
uptake at the microscopic scale is crucial for 
driving emergent phenomena and self-organ- 
ization in disparate systems* — from naturally 
occurring ones, such as bacterial suspensions 
and flocks of birds, to chemical and mechani- 
cal analogues, such as self-propelled Janus 
colloids (microscopic particles that have two 
faces with distinct properties). 

When Sanchez et al. confined the micro- 
tubule network to a water-oil interface, the 
resulting dense, two-dimensional film again 
exhibited self-sustained streaming flows 
that seemed to be associated with bundle 
fracturing and healing. The complex dynam- 
ics yielded patterns resembling topological 
defects — structures that can be generated in 
liquid crystals at equilibrium by confinement 
or external drive. 

Finally, when the researchers confined 
the active gel to droplets of at least 30 micro- 
metres in diameter, the gel was spontaneously 
adsorbed to the inner surface of the drop- 
lets, turning into a two-dimensional active 
film on a curved substrate. Remarkably, the 
self-sustained active flows of the trapped gel 
drove autonomous movement of the droplet 
ona substrate. Although the motile droplets 
moved along somewhat circular trajectories, 
rather than travelling in a straight line, they 
covered about 250 micrometres in 33 minutes. 
These moving drops bring to mind recent 
theoretical work showing that active drops in 
a fluid spontaneously acquire directed motil- 
ity. For drops in which the active constituents 
— the microtubule bundles in Sanchez and 
colleagues’ work — form large domains and 
have, on average, a common orientation but no 
preferred direction, the theory indeed predicts 
rotational motion of the drops. 

Reconstituted microtubule-kinesin sys- 
tems have been explored before as models for 
active self-assembly, not least in the remark- 
able experiments” that led the way to current 
studies of pattern formation in active systems. 
In those experiments, kinesin complexes 
driven by ATP organized microtubules into 


*This article and the paper under discussion’ were 
published online on 7 November 2012. 
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spirals and asters reminiscent of a cell’s mitotic 
spindle, a star-like microtubule assembly that 
mediates cell division. One important differ- 
ence is that the structures seen in the earlier 
work®” were essentially static, whereas Sanchez 
and colleagues’ microtubule gel generates con- 
tinuously evolving, spontaneous flows that 
persist as long as ATP is present — not unlike 
what happens in living cells. Furthermore, 
Sanchez et al. report that the internally gener- 
ated flows in their active gel can be tuned by 
varying the ATP concentration, confirming 
the self-sustained, non-equilibrium nature 
of the dynamics. The fact that microtubules 
are assembled into bundles seems to be essen- 
tial for yielding self-sustained motion (see 

Movie S2 in the Supple- 
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go.nature.com/ougib8 an open question. Also 


ANIMAL BEHAVIOUR 


unexplained is why the behaviour of the active 
microtubule network is so different from that 
of gels composed of actin filaments and myo- 
sin motor proteins, in which activity yields 
spontaneous contraction®. 

Sanchez and colleagues’ work is a beautiful 
example of a growing class of experiment in 
biomimetic assembly, aimed at building sys- 
tems that exhibit some of the features of liv- 
ing matter. Will it be possible to control and 
direct the motility of the active droplets? And 
can the flow-induced structures be harnessed 
and used as guides for the transport of parti- 
cles through fluid, as those in cells are? This 
remains to be seen. Meanwhile, experiments 
of this type are beginning to shed light on 
the physical aspects of the complex dynami- 
cal reorganization that occurs continuously 
inside cells. When combined with studies of 
the biochemical machinery and signalling that 
drive such reorganization, they may ultimately 


Personality in the wild 


Behavioural traits can influence an individual animal’s fitness, and trait 
combinations can change over its lifetime, according to a study of wild trout 
during a key period in their development. 


ALISON M. BELL 


L: is hard for a young brown trout in 


a cold Swedish stream. There are so 

many dangers to watch out for, such as a 
hungry mink lurking around the bend, and so 
many things to do, such as competing for food. 
Indeed, a young brown trout has only around 
a 10% chance of surviving to adulthood’. Ifa 
fish can beat the odds and survive this danger- 
ous period, it emerges different from before 
— not just bigger, but also behaviourally 
changed. A recent paper by Adriaenssens 
and Johnsson in Ecology Letters’ reports 
that the individuals that make it through 
this bottleneck behave more predictably 
across contexts than they did before. Is 
this because of those harrowing early 
experiences? Or are these fish the ones 
that were better adapted in the first place? 
Adriaenssens and Johnsson’s findings 
suggest that the answer is an intriguing 
combination of both factors. 

Animal personalities are interesting to 
researchers because behaviour is notori- 
ously flexible — unlike most morphologi- 
cal traits, behaviour can change almost 
instantaneously. Within seconds, a fish 
might go from aggressively attacking an 
intruder to foraging alone in the middle of 
the stream. But there is growing evidence 
that behaviour does not always change 


at a moment’s notice, and that animals have 
distinctive personalities that they retain over 
time. One view’ is that animal personalities 
may result from constraints: limiting mecha- 
nisms that prevent an individual from being 
able to change, such as a genetic propensity. 
Analternative interpretation’ is that consistent 
differences in behaviour between individuals 
might be the result of adaptation through 
natural selection. There is evidence for both 
the constraint® and the adaptive® models, 


Figure 1 | Fishy activity. Adriaenssens and Johnsson’s 
study’ of the behaviour of wild brown trout (Salmo trutta) 

shows that individuals with consistently high activity levels 
are more likely to survive the early months of life. 
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lead to a quantitative understanding of the 
mechanics of living matter. m 
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and strong support for the latter comes from 
research showing that behavioural consistency 
arises from both behavioural plasticity (when 
individuals change their behaviour in response 
to environmental conditions) and non- 
random survival of individuals’. However, 
that study was carried out in the lab, where 
life is relatively simple. The significance of 
Adriaenssens and Johnsson’s work is that it 
starts to show us how adaptive personalities 
can emerge in the wild. 

The authors captured young (around two 
and half months old) brown trout (Salmo 
trutta; Fig. 1) ina stream in western Sweden 
and gave each individual a unique colour mark. 
The trout were then put through a series of 
behavioural assays in the lab. One of these was 
an ‘open-field test; in which trout were indi- 
vidually placed in an open arena and observed 
to determine whether they were the kind of fish 
that explores everything, or the type that moves 
little and hunkers down in one spot. Another 
assay involved a confrontation with an 
opponent — in this case, the trout’s own 
reflection in a mirror. Here, the researchers 
were looking to see whether the individual 
attacked the intruder or if it was relatively 
non-aggressive. After assessing each fish 
in all of the assays, the researchers released 
them back into the stream. 

Two months later, Adriaenssens and 
Johnsson returned to the stream. Of the 
81 individuals that were tested, they recap- 
tured 28. On the basis of the assumption 
that those fish that were not recaptured 
had died, the authors’ analyses showed 
that an individual's behaviour predicted 
its survival: trout that had been very active 
in the open-field test were more likely to 
survive to 4.5 months of age than those 
that had moved around less. An alterna- 
tive explanation would be that the inactive 
individuals did not die, but rather were 
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harder to recapture or more likely to disperse 
out of the study area. However, Adriaenssens 
and Johnsson provide evidence against both of 
these possibilities, showing that inactive fish 
were in fact easier to catch and did not move 
as far in the stream. So the first notable result 
from this study is that it shows natural selec- 
tion acting on differences in behaviour among 
individuals in a wild population. 

The authors then put the recaptured fish 
through the same assays and found that their 
behaviour had changed during their time 
back in the stream — the trout were more 
active at 4.5 months of age than they had been 
at 2.5 months. However, despite this overall 
behavioural change and the vagaries of life in 
the wild, the survivors retained their relative 
personality traits: the fish that had been the 
most active in the first round of testing, for 
example, were still the most active. 

Finally, and perhaps most intriguingly, 
Adriaenssens and Johnsson report that the 
individuals’ behaviour became more distinc- 
tive. Whereas the young trout did not behave 
consistently across the different behavioural 
assays, the older trout did, such that highly 


exploratory individuals were now also more 
aggressive. In other words, the individuals 
were now more predictable, and a ‘behavioural 
syndrome” linking exploratory behaviour to 
aggressive behaviour had emerged. 

There are two ways in which this could have 
occurred. First, it may be that only those indi- 
viduals that were relatively exploratory and 
aggressive, or relatively non-exploratory and 
non-aggressive, in the first place survived. 
Alternatively, it is possible that individuals 
changed their behaviour over time and this 
caused different behaviours to become coupled 
together. Adriaenssens and Johnsson present 
evidence in support of both mechanisms, but 
they were unable to thoroughly disentangle 
the two; designing experiments that can tease 
these two processes apart is a pressing goal for 
future work. Further studies are also needed to 
determine whether there are consequences of 
such non-random survival in the next genera- 
tion — in other words, if behavioural variation 
is heritable. 

The biggest question, though, is why did 
individual fish become predictable? That is, 
why did behaviours become coupled together, 


Complexion matters 


Sun exposure indisputably increases the risk of skin cancer. Mouse studies 
suggest that, in red-haired individuals, genetic factors also contribute through a 
mechanism that acts independently of exposure to sunlight. SEE LETTER P.449 


MIZUHO FUKUNAGA-KALABIS 
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he most common environmental risk 
factor for skin cancer is overexposure 
to sunlight. But is protecting skin from 
the sun enough to prevent cancer? In this issue, 
Mitra et al.’ address this question using mice 
that are genetically similar to humans who 
have red hair and fair skin or to dark-skinned 
or albino individuals. They find that the often 
deadly skin cancer melanoma occurs more fre- 
quently in ‘redheaded’ mice than in the other 
two groups, owing to mechanisms that are 
unrelated to exposure to ultraviolet light*. 
Colours of skin, eyes and hair vary widely 
among humans. This variation is controlled 
by the amount and ratio of two forms of the 
pigment melanin (the red—yellow pheomela- 
nin and the brown-black eumelanin)’. Both 
pigment types are produced by melanocytes 
— cells that are located in the basal layer of the 
skin epidermis, in hair follicles and in the uvea 
of the eye. Melanocyte-stimulating hormone 
binds to the melanocortin 1 receptor (MCIR) 


*This article and the paper under discussion’ were 
published online on 31 October 2012. 


on the surface of melanocytes, initiating a 
biochemical cascade that leads to increased 
levels of the enzymes required for eumelanin 
synthesis. Disturbance of the MC1R-mediated 
signalling pathway reduces total melanin pro- 
duction but increases the relative abundance 
of pheomelanins — similar to what happens 
when MCI1R is not activated’. 

In humans, the MC1R gene can show great 
variability (polymorphism) in sequence. Cer- 
tain polymorphisms in this gene are associated 
with red hair colour (RHC) and so are called 
RHC variants. These variants result in the loss 
of functional MCI1R, and individuals carrying 
them often have red hair, fair skin, a tendency 
to freckle and little ability to tan*. 

Caucasians are generally at a higher risk of 
developing melanoma than are non-Cauca- 
sians, and red-haired individuals in particular 
are more susceptible to melanoma than are 
those with other hair colours’” (Fig. 1). 
Moreover, epidemiological analyses from 
pooled data sets indicate* that RHC vari- 
ants of MC1R are associated with melanoma 
risk. These observations are not surprising, 

as red hair is rich in pheomelanin and has 
little eumelanin. Because eumelanin shields 
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and what are the fitness advantages to being 
a trout that behaves consistently? Several 
hypotheses exist to explain why behaviours 
should come packaged together® — for exam- 
ple, that there might be social benefits of 
being predictable’. But so far there have been 
few empirical tests of these ideas, and this 
represents another exciting challenge for 
future work. m 
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the skin by absorbing ultraviolet (UV) rays*, 
the skin of red-haired people has a particular 
tendency to accumulate light-induced damage. 

Mitra et al. show that completely avoiding 
UV rays would not protect red-haired people 
from melanoma. As an animal model of red- 
haired individuals, the authors used mice with 
mutated, non-functional Mclr, and thus with 
melanocytes that cannot induce eumelanin 
synthesis. Whereas control mice had a dark- 
brown, nearly black coat colour, the Mclr- 
mutant mice had golden-yellow coats. A 
third group of mice expressed non-functional 
tyrosinase — an enzyme essential for both 
eumelanin and pheomelanin synthesis — 
and had white coats, mimicking albinism in 
humans. These ‘white’ mice and the ‘black’ 
mice possessed intact Mcl1r. 

Mitra et al. crossed their differently 
coloured mice with mice that expressed 
BRaf‘®** — one of the most common gene 
mutations in melanoma — in their melano- 
cytes’. BRAFY® is carried by 40-60% of 
patients with this cancer’’. Furthermore, 
patients with melanoma who carry RHC vari- 
ants also have a high frequency of BRAF muta- 
tions”, suggesting that BRAF mutations have 
a role in the development of melanoma in the 
red-haired population. 

When BRaf*” expression was induced in 
melanocytes, the black mice and their white 
counterparts developed melanoma only 
at low rates and after long periods of time. 
By contrast, more than 50% of the redhead 
mice developed this cancer within a year of 
BRaf‘®* induction, despite being kept in a 
UV-free environment. Surprisingly, block- 
ing pheomelanin synthesis not only gave 
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Figure 1 | Genetics and the sun. Individuals with fair skin and red hair are more susceptible to 
melanoma than people who have darker complexions. Mitra et al.’ show that this may be due not 
only to their greater sensitivity to sun exposure, but also to variants in their MCIR gene. 


the redhead mice a white coat colour, it also 
reduced the incidence of melanoma among 
them. These results strongly indicate that 
the pheomelanin synthesis pathway plays an 
important part in melanoma development. 

Why do redhead mice develop melanoma 
in the absence of UV light? It seems that red- 
heads carrying RHC variants have a higher 
risk of melanoma because of intrinsic oxida- 
tive DNA damage, in addition to their poor 
protection from UV. Eumelanin is a strong 
antioxidant and reduces the accumulation of 
DNA damage by absorbing reactive oxygen 
species (ROS). Although pheomelanin may 
increase cancer risk by generating ROS in 
response to UV exposure”, this pigment — or 
its chemical intermediates — can also gener- 
ate ROS through a mechanism independent 
of UV radiation’*"*. The present paper shows 
that ROS-mediated damage to both DNA and 
lipids accumulates more readily in the skins 
of Mclr-mutant redhead mice than in Mclr- 
mutant white mice, even without UV expo- 
sure. Moreover, any such exposure seems to 
exacerbate oxidative damage selectively in 
redhead mice’. 

Studies are under way to determine 
whether the increased melanoma risk in 
redheads is limited only to cancers driven by 
mutant BRAF, or whether it also applies to 
other melanoma oncogenes such as NRAS. 
In addition, it remains to be confirmed 
whether the risk of this cancer increases in 
individuals whose red hair is conferred by 
polymorphisms in other genes of the pigment 
pathway. 

But perhaps the most pertinent question is 
what can be done, beyond sun protection, to 
decrease melanoma risk in red-haired, fair- 
skinned individuals? Destroying all melanin 
is not an option, because the skin would lack 
a natural sunshield — eumelanins. Besides, 
Mitra and co-workers’ black mice were 


relatively well protected against melanoma 
even though they possessed both pheomela- 
nin and eumelanin. It is conceivable that, in 
these animals, the abundant eumelanin scav- 
enges pheomelanin-derived ROS. Therefore, 
it would be of great interest to determine 
whether topical compounds that induce 
eumelanin synthesis (such as forskolin), or oral 
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intake of antioxidants, decrease melanoma risk 
in redheads. Moreover, red-haired individuals 
should undergo frequent dermatological skin 
checks, besides avoiding sun exposure. = 
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Putting a spinon 
photon entanglement 


Entanglement between a photon and a stationary particle is a key resource for 
quantum communication. The effect has now been observed for a photon and 
asingle electron spin in a semiconductor nanostructure. SEE LETTERS P.421 & P.426 


SOPHIA E. ECONOMOU 


uantum mechanics, the theory that 
explains the properties of matter through 
the motion of its microscopic constitu- 
ents, is known for its peculiar and counter- 
intuitive concepts. These concepts are not only 
crucial to the structure of the theory, but also 
form the basis of many technologies, includ- 
ing lasers, the scanning tunnelling microscope 
and atomic clocks. In this issue, De Greve et al.’ 
and Gao et al.” describe how they have inde- 
pendently demonstrated one such concept — 
quantum entanglement — in a semiconductor 
nanostructure known as a quantum dot. 
Entanglement is arguably the most striking 
feature of quantum mechanics. Two systems, 
A and B, are said to be entangled if their 
quantum states are correlated — that is, not 


separable. This means that although we cannot 
predict what state system A will be in when 
it is measured, we can predict with certainty 
the measurement outcome of system B once 
A is measured, even if the two systems are 
separated by an arbitrarily large distance. This 
feature famously unsettled Einstein — who 
deemed it “spooky action at a distance” — and 
led to the renowned ‘EPR paper questioning 
the completeness of quantum theory’. 

In recent decades, with the advent of 
quantum information science’, entanglement 
has evolved from being considered a counter- 
intuitive curiosity of quantum theory to being 
recognized as a key resource for quantum com- 
munication and computation. In particular, 
entanglement between a stationary quantum 
system anda ‘flying’ quantum system, such asa 
photon, has the potential to allow long-distance, 
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secure communication and entanglement 
between two stationary systems that have 
never ‘met’ but that become linked by the fly- 
ing system. Therefore, if quantum information 
processing is to become a practical reality, it is 
crucial to generate and quantify entanglement 
between viable physical systems. In recent years, 
such entanglement has been shown for trapped 
ions’ and for certain defect states in diamond 
called nitrogen-vacancy centres®. 

Now, for the first time, De Greve et al. and 
Gao et al. demonstrate entanglement between 
a photon and a single electron spin trapped ina 
quantum dot. Formed at the interface between 
two semiconductors, the quantum dot has 
the ability to trap single electrons. Electrons 
confined in quantum dots remain in discrete, 
atomic-like spatial states without the need for 
external electromagnetic traps, and they have 
well-defined quantum properties. Once an 
electron is trapped in the dot, experimenters 
can focus on its spin, which can be in two dis- 
tinct states (“‘up’ or down) or, according to the 
superposition principle of quantum mechan- 
ics, in a linear combination of the two. In the 
beautiful experiments of De Greve et al. and 
Gao et al., it is the electron’s spin that becomes 
entangled with the polarization and the colour 
(frequency) ofa photon, respectively. Polariza- 
tion describes whether the light’s electric field 
oscillates horizontally, vertically, in a circle or 
in any combination of these. 

To generate the photon, both groups made 
use of the fact that quantum dots are optically 
active. First, the authors prepared the system 
using a technique called optical pumping, 
which amounts to depleting the spin-down state 
with a laser and pumping the spin into the up 
state. They then used a laser pulse of appropri- 
ate frequency and duration to excite the system 
to a higher-energy state and allowed it to ‘relax’ 
back to either one of the two lower-energy 
states. This kind of relaxation is accompanied 
by the spontaneous emission of a photon. With 
this process, the entire electron—photon sys- 
tem is in a superposition of two states: one in 
which the electron has spin-up orientation and 
the photon is blue and vertically polarized, and 
the other in which the electron has a spin-down 
orientation and the photon is red and horizon- 
tally polarized. The state of the emitted photon 
is completely quantum correlated (entangled) 
with the final electron spin state. 

One problem with this kind of final state is 
that both the colour and polarization of the 
photon are correlated with the electron spin. 
For entanglement to be used in applications, 
only one of the two should be correlated. There- 
fore, both groups eliminated the unwanted cor- 
relation with the additional photon property. 
De Greve and colleagues downconverted the 
photon (that is, reduced its frequency by half), 
thereby broadening its frequency range. This 
step was crucial to obtaining the desired fre- 
quency state of the photon, because the correla- 
tion between electron spin and photon colour 


was erased as a result of having one, broadened 
frequency value instead of two, distinct, sharp 
ones. The remaining quantum entanglement 
was then solely between the electron spin and 
the photon polarization. The downconversion 
has the additional benefit of providing a photon 
that is at the wavelength used in telecommu- 
nications, in which lossless photon propaga- 
tion in optical fibres can be extremely long. 
As a result, this experiment is an important 
first step towards achieving practical quantum 
networks’ based on quantum-dot electron 
spins and photons. 

Meanwhile, Gao and colleagues opted to 
use frequency as their entangled quantity, 
and so had to remove the correlation with 
polarization. They achieved this by ‘filter- 
ing’ the photon, regardless of its frequency, 
so that it acquired an anticlockwise, circular 
polarization. This eliminates the correlation 
with polarization, leaving an entangled state 
of photon colour and electron spin. Their 
filtering also solved the practical problem of 
separating the emitted photon from the vast 
number of photons emitted by the laser, which 
had a clockwise, circular polarization. 

Using quantum dots as photon emitters has 
potential advantages over competing systems. 
Quantum dots have a higher electric dipole 
moment, leading to faster photon emission. 
What’s more, they can be placed in optical 
microcavities to increase photon yield’. One 
avenue for future research would be the addition 
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ofa second, distant quantum dot to the authors’ 
set-up, with joint measurement of the two emit- 
ted photons (one from the first quantum dot 
and the other from the second) to entangle the 
spins of the two dots’. Another possibility would 
be to ‘re-pump’ a single quantum dot to extract a 
second photon, which under certain conditions 
will be entangled with the first. This idea could 
be extended” to produce multi-photon entan- 
gled states for ‘measurement-based’ quantum 
computing”. The present experiments pave 
the way to such demonstrations, and contrib- 
ute to efforts to achieve large-scale, practical 
processing of quantum information. m 
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NO 


caught in traffic 


Nitric oxide is a vital signalling molecule that controls blood flow and pressure. 
Unexpectedly, a redox switch in the protein haemoglobin o within endothelial 
cells regulates this molecule’s diffusion in blood vessels. SEE LETTER P.473 


MARK T. GLADWIN 
& DANIEL B. KIM-SHAPIRO 


itric oxide is a vasodilator. It is 

produced in endothelial cells, which 

line blood vessels, and mediates 
signalling cascades in adjoining smooth- 
muscle cells to affect the regulation of blood 
pressure, blood flow and oxygen delivery. 
Nitric oxide is proposed to control these cru- 
cial physiological processes through simple 
unregulated diffusion from its site of produc- 
tion to its target sites. But on page 473 of this 
issue, Straub and colleagues provide evidence’ 
that the oxidation state of the protein haemo- 
globin a, which is expressed at the junction 
between endothelial and muscle cells, regulates 
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nitric oxide diffusion and signalling*. 
Haemoglobin is best known for mediating 
oxygen delivery by erythrocytes (red blood 
cells). Nonetheless, low concentrations of 
this protein are expressed in other cells, such 
as human lung epithelial cells”. In addition, 
over the past 15 years several primordial glob- 
ins— including neuroglobin and cytoglobin 
— have been discovered that are expressed in 
various non-erythrocytic organs such as the 
brain, retina, endocrine organs and vascu- 
lar smooth muscle. Functions that are being 
explored for these proteins include mediat- 
ing electron-transfer reactions (such as the 
reduction of the enzyme cytochrome c and 


*This article and the paper under discussion? were 
published online on 31 October 2012. 


nitrite (NO, ) reduction) and nitric oxide 
(NO) scavenging reactions**. For instance, 
flavohaemoglobins and myoglobin have been 
proposed” to limit NO signalling by convert- 
ing it to nitrate (NO, ) in a dioxygenation 
reaction. A role for haemoglobin in NO- 
scavenging reactions is appealing on theoreti- 
cal grounds because these reactions are fast 
and can occur at the low globin concentrations 
found in cells. 

The unique diffusion properties of 
NO (it forms a spherical concentration 
gradient around its source) create chal- 
lenges for directional and compartmen- 
tal signalling by this molecule. To reach its 
signalling target, NO must escape reactions 
that convert it to inert species. 

Several solutions have evolved. These 
include spatial localization and control of NO 
production within cell-membrane structures 
called caveoli, and within hot spots of meta- 
bolic enzymes, where NO is coupled to its 
target. Alternatively, Straub et al. propose that 
the thick walls of the internal elastic lamina 
(the outermost elastic tissue of blood vessels 
that separates endothelium and smooth- 
muscle cells) create physical distance between 
the source of NO production (the enzyme 
endothelial NO synthase) and the NO target, 
the smooth-muscle soluble guanylyl cyclase 
enzyme. 

At regular points between the endothelium 
and smooth muscle, known as myoendothe- 
lial junctions (MEJs), the cells project through 
the internal elastic lamina and ‘kiss’ each other, 
increasing the NO concentration at these cor- 
ridors. These points of cell-cell contact con- 
tain gap-junction channels that allow electrical 
coupling and intercellular diffusion of vasodi- 
latory factors, such as eicosanoids, potassium 
ions and hydrogen peroxide, and small solutes 
that can dynamically regulate blood-vessel 
diameter®. Straub et al. propose that these junc- 
tions also create corridors within the internal 
elastic lamina for NO diffusion. 

The authors find that haemoglobin a is 
concentrated at the MEJs. Here, this protein 
serves to block NO diffusion to smooth mus- 
cle, specifically through the extremely fast and 
irreversible dioxygenation reaction of NO with 
ferrous (Fe**) oxyhaemoglobin to form nitrate 
and methaemoglobin, in which the iron in the 
haem group of haemoglobin is in the ferric 
(Fe**) state”* . 

The team also reports that the enzyme 
cytochrome b5 reductase 3 — also called met- 
haemoglobin reductase — forms a complex 
with cellular haemoglobin a and regulates 
NO diffusion by reducing the Fe**-haem to 
the oxygen-binding Fe**-haem. This provides 
an enzymatic mechanism to control NO dif- 
fusion (Fig. 1), because only the Fe**-haem 
can scavenge NO. By contrast, the Fe**-haem 
does not bind NO tightly and so allows its dif- 
fusion through the MEJ to the smooth muscle, 
where it activates soluble guanylyl cyclase and 
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Figure 1 | Regulation of nitric oxide (NO) diffusion by cytochrome b5 reductase 3. Endothelial cells 

of blood vessels connect to their neighbouring muscle cells through myoendothelial junctions (MEJs), 
which span internal elastic lamina. a, Straub et al.' show that when cytochrome b5 reductase 3 is inactive, 
the abundant haemoglobin a at the MEJs is in the ferric (Fe’*) methaemoglobin state, which does not 

bind tightly to the vasodilator NO. Consequently, NO passes to muscle cells where it mediates signalling 
cascades that result in vasodilation through activation of soluble guanylyl cyclase. b, Active cytochrome 

b5 reductase 3 forms a complex with methaemoglobin a, reducing it to the Fe” state. The latter blocks NO 
diffusion to smooth muscle, through a dioxygenation reaction that results in the formation of nitrate (NO; ). 


mediates the downstream signalling cascade. 

Another outcome of NO interaction with 
methaemoglobin is reductive nitrosylation, 
which can form the signalling molecules 
nitrite or S-nitrosothiols’. Both nitrite and 
S-nitrosothiols can regulate NO signalling 
independently of soluble guanylyl cyclase and 
by post-translational modification of target 
proteins. However, the reductive nitrosyla- 
tion reaction is some 200 times slower than 
the scavenging reactions’. Nonetheless, the 
Fe**-methaemoglobin allows NO to diffuse 
through the MEJ or to react and form the freely 
diffusible nitrite and S-nitrosothiols. 

Straub and colleagues’ work complements 
other studies*’””' that were conducted under 
conditions of hypoxia (oxygen shortage) and 
which found that deoxygenated globins can 
function as nitrite reductase enzymes that 
react with nitrite to generate NO. For instance, 
when oxygen levels are low in smooth-muscle 
cells, myoglobin can reduce nitrite to NO, con- 
tributing to vasodilation". In this reaction, the 
Fe**-deoxymyoglobin transfers an electron 
to nitrite to form NO and metmyoglobin. It 
seems, therefore, that globins can limit NO 
signalling when reduced and oxygenated and 
enhance NO signalling when oxidized and 
deoxygenated (Fig. 1). 

This paper highlights a novel function of 
the MEJ as an NO diffusion corridor. The 
expression and localization of haemoglobin a 
chains and cytochrome b5 reductase 3 at the 
ME) constitute a specific checkpoint or traf- 
fic light for redox-regulated NO diffusion 
at these corridors. Future work must clarify 


the post-translational modifications of cyto- 
chrome b5 reductase 3 that control its activity, 
and which would ideally couple activation of 
NO synthase to NO diffusion and ME]J ‘gate 
opening. Moreover, haemoglobin functions 
beyond NO scavenging must be explored, not 
least because of the widespread expression of 
cellular haemoglobins in plants, which do not 
possess NO synthase enzymes. = 
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bnormal metabolism is at the heart of some 
A serious health problems (such as obesity, 

diabetes and cancer), which not only reduce 
our life expectancy, but are also a great cost to society. 
This Insight offers a snapshot of the molecular 
mechanisms that underlie metabolism and its 
associated pathology, and showcases the progress 
made in this buoyant area of research. 

Metabolism beats to a drum of about 24 hours. In his 
Review, Joseph Bass shows how breakthroughs in our 
understanding of circadian rhythms and molecular 
clocks bring insight to the molecular pathogenesis of 
metabolic disorders. 

Although direct actions of hormones, such as 
insulin, regulate nutrient handling in peripheral 
tissues, the central nervous system also plays a 
significant part in metabolic regulation. From specific 
signalling mechanisms to hypothalamic circuitry, 
Martin Myers and David Olson discuss how the brain 
controls metabolism. 

Cancer cells thrive by switching to a different 
metabolic program. The mechanistic basis for these 
changes and how they are connected to oncogenic 
pathways is becoming increasingly understood. 

Almut Schulze and Adrian Harris discuss these 
advances, and explore strategies that interfere with these 
metabolic pathways for use as anticancer therapies. 

Much of our understanding of mitochondria has 
come from studying rare mitochondrial disorders. 
Scott Vafai and Vamsi Mootha review what we 
have learned from these diseases by putting them 
in the context of a contemporary understanding of 
mitochondrial evolution, biochemistry and genetics. 

Finally, Jeremy Nicholson and colleagues discuss 
how metabolic phenotyping, which involves a 
comprehensive analysis of biological fluids or tissue 
samples, can facilitate the biochemical classification 
of individual physiological or pathological states. This 
approach is currently being used in clinical practice to 
assist with disease diagnosis, prognosis and treatment 
selection for individual patients and to estimate 
disease risks at the population level. 


Joshua Finkelstein, Noah Gray, 
Marie Thérése Heemels, Barbara Marte & Deepa Nath 
Senior Editors 
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Circadian topology of metabolism 


Joseph Bass'* 


Biological clocks are genetically encoded oscillators that allow organisms to anticipate changes in the light-dark envi- 
ronment that are tied to the rotation of Earth. Clocks enhance fitness and growth in prokaryotes, and they are expressed 
throughout the central nervous system and peripheral tissues of multicelled organisms in which they influence sleep, 
arousal, feeding and metabolism. Biological clocks capture the imagination because of their tie to geophysical time, and 
tools are now in hand to analyse their function in health and disease at the cellular and molecular level. 


on the supposition that sleep is inevitable at night and waking 

should correspond to sunrise. However, some people are ‘larks’ 
and wake early, whereas others are ‘night owls’ and stay up late, hint- 
ing that there is a biological driver of sleep-wake rhythms. A triumph 
of modern genetics has been the identification of the molecular path- 
ways that dictate the sleep-wake cycle and other 24-hour-circadian 
(derived from circa diem, about a day) rhythms (Fig. 1). Many of 
the principles of this system originally came from genetic studies of 
model-organism mutants with altered period phenotypes, and are 
described in this Review in the context of our current understanding 
of clock systems in mammalian tissues. These advances in under- 
standing can also be considered in the context of human studies that 
suggest that artificial light, night work, a reduction in normal sleep 
time, shift work, travel and temporal disorganization, all of which 
are common in industrialized societies, have disrupted the pattern 
of alignment between the external light-dark cycle and the internal 
clock, which was set to a 24-hour day early in evolution (Fig. 2). 
Identification of the molecular clock may lead to insight into cir- 
cadian and sleep disorders in humans. This Review highlights how 
advances in the field of molecular clocks could help in understanding 
the molecular pathogenesis of metabolic disorders across the lifespan. 


B enjamin Franklin’s dictum “early to bed, early to rise” is built 


Origins of circadian clocks 

Consciousness of the temporal world has been a hallmark of civiliza- 
tion from prehistoric times — reflected in the iconic solar worship site 
at Stonehenge. Yet, the correspondence between biological and geo- 
physical phenomena was not recognized until the early 1700s, when 
the French astronomer Jean-Jacques d’Ortous De Mairan demon- 
strated that the leaves of Mimosa pudica continue to open and close 
every 24 hours even when the plant was enclosed in a sealed box. Since 
then, other photosynthetic organisms, and nearly all forms of life on 
the surface of the planet, have been shown to exhibit similar circadian 
cycles. The principal criteria of circadian oscillators emerged from the 
work of Pittendrigh and Aschoff, who established the defining charac- 
teristics of biological clocks: a persistent and sustained period length 
under constant conditions, entrainment to environmental signals such 
as light, and stability across wide variations in temperature (referred to 
as temperature compensation). 


The transcriptional motif 

A leap forward in our understanding of molecular clocks came from the 
deliberate mutagenesis studies of Konopka and Benzer in the fruitfly 
Drosophila melanogaster. These studies screened for 24-hour rhythms 


in the fly’s emergence from the pupal case, and provided a gateway to the 
modern era of circadian genetics’. A key advance in circadian genetics 
was the concept that clocks comprise a transcription autoregulatory 
feedback loop, with the forward limb encoding activators that promote 
transcription ofa set of repressors, which feed back to inhibit expression 
and function — a cycle that repeats itself every 24 hours across divergent 
phyla’? (Fig. 1). Indeed, transcriptional oscillators may have provided a 
selective advantage early in evolution by averting the DNA-damaging 
effects of sunlight. The presence of a photolyase domain in clock repres- 
sors indicates that the timing systems co-evolved with DNA repair’. 


A metabolic variation on transcription 

Although the transcription feedback loop represents a conserved 
model of the circadian oscillator, recent advances point towards meta- 
bolic oscillators as an additional mechanism of circadian timekeeping 
that can be, in certain circumstances, independent of transcription. A 
remarkable series of test-tube experiments”® has provided the most 
convincing evidence for a protein-based clock. A complex of just three 
proteins (KaiA, KaiB and KaiC) together with ATP undergo a self-sus- 
taining 24-hour cycle of alternating phosphorylation and dephospho- 
rylation. The phosphorylation cycle is both coupled to and directs gene 
transcription, although it can also be seen in the absence of transcription 
under certain circumstances’. The cycling of this kinase-phosphatase 
reaction remains constant at different temperatures — a defining feature 
of circadian oscillators® (Fig. 3). The turnover of ATP modulates the 
Kai phosphorylation cycle, suggesting that the clock may be coupled to 
metabolic activity’. An interdependence between circadian and meta- 
bolic oscillators has also been suggested by showing that the activity of 
clock transcription factors is sensitive to redox state® (Fig. 3). 

More recently, periodic flux in metabolic cycles has been related to 
production of reactive oxygen species (ROS). Peroxiredoxin is a redox- 
sensitive protein, which has a reactive thiol within the active site that 
is involved in electron transfer from reactive oxygen. These proteins 
exhibit 24-hour oscillation in cells in the absence of transcription — as 
shown in erythrocytes’ — and under conditions when transcription 
is arrested in the alga Ostreococcus tauri'*. Peroxiredoxin proteins 
have been shown to exhibit self-sustained oscillation in many spe- 
cies of archaebacteria, plants and in other eukaryotic cells, suggesting 
that a redox cycle may be one of the most conserved 24-hour oscil- 
lators'' (Fig. 3). Determining whether the peroxiredoxin proteins are 
themselves clocks or reporters of a more primary metabolic oscillator 
will require further genetic and biochemical analyses. 

Given that eukaryotes first diverged from bacteria around 
1.5 billion years ago, a provocative speculation is that oxygenation 
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of the atmosphere conferred an adaptive advantage on organisms 
with a redox-based clock. If this speculation is true, then organisms 
existing in anaerobic conditions, such as deep-sea vaults, might have 
less pressure to eliminate ROS, and may be devoid of a metabolic 
clock. In mammals, oscillations in the peroxiredoxin redox state 
have been proposed to represent a means of rhythmically anticipat- 
ing the generation of ROS. Conceivably, organisms that are capable 
of efficiently extinguishing ROS may have had a survival advantage 
during the oxygen expansion of the atmosphere. If this is true, then 
the peroxiredoxin clock may keep time at the organismal level by 
regulating the oscillation of ROS. Consideration of the origins of 
internal clocks remains central to our understanding of links between 
circadian and metabolic systems (Fig. 3): was it escape from sunlight, 
avoidance of toxic metabolites during respiration or elimination of 
inefficient metabolic cycles that drove these processes together? Iden- 
tifying metabolic circadian clocks could determine whether there 
is a monophyletic origin of organisms that possess such oscillators, 
emphasizing the fundamental clues that clocks may yield for under- 
standing evolutionary and functional relationships among species. 


Clocks impact on fitness in multicellular organisms 

A principle to emerge from genetic studies is that period length 
is genetically programmed. But how and why are the body’s inter- 
nal clocks set to 24 hours? Resonance studies are perhaps the best 
approach to address this question because they discriminate between 
oscillator-dependent and independent gene functions. Phenotypes 
such as growth and reproduction are monitored under conditions 
in which alignment between internal period length and the envi- 
ronmental light cycle are systematically varied (long- and short- 
period mutants are maintained under lengthened or shortened light 
cycles). In bacteria and plants, such studies suggest an advantage 
to period alignment”. But, what are the consequences of misalign- 
ment at the molecular level? One possibility is that misalignment 
reduces genome stability by shuffling the phase relationship between 
cycles of DNA damage and repair. Alternatively, misalignment may 
superimpose incompatible biochemical processes, such as the oxi- 
dative and reductive phases of the metabolic cycle. For instance, in 
yeast (Saccharomyces cerevisiae), the mutation rate increases when 
metabolic cycles are misaligned with DNA replication’*. Further- 
more, although circadian cycles are directly photosensitive in plants 
and even in the cells of D. melanogaster, in mammals the role of 
cryptochromes as both a timekeeper and a DNA-damage-repair 
agent is uncoupled. It is tempting to speculate that the coupling of 
DNA repair to circadian cycles may contribute to ageing even in 
higher eukaryotes. 


Neural-clock sensory circuit and circadian-system ageing 
Understanding how complex organisms detect light and synchronize 
the clocks in brain and other tissues to the environment remains a 
central challenge in circadian research. Sensory pathways within the 
brain have been identified that synchronize the clock independently of 
visual image formation. The light-response pathway also synchronizes 
neural and peripheral clocks, but the integrity of the clock network 
may decline with age. 


Light-response pathway in the retina 

A breakthrough in understanding the entrainment of oscillators in 
animals came with the discovery that even mice without the classic 
rod and cone visual photoreceptors are still able to synchronize their 
internal clocks to light. The light photoreceptors are expressed in 
a small number of retinal cells that express the photopigment mel- 
anopsin™. Mice that are genetically depleted of these few hundred 
cells still have normal vision, but they are unable to synchronize 
their clocks to light’. All connections to the hypothalamic pace- 
maker neurons in the suprachiasmatic nucleus go through these 
few melanopsin cells'*. Suprachiasmatic-nucleus neurons comprise 


REVIEW 


A 


M ar oscillation| 


Protein 


Light 
(Energy harvesting, 
DNA damage) 


Dark 
(Energy storage, 
DNA repair) 


Behavioural Physiological rhythms. Organismal 
rhythms cycles 

Brain (non-autonomous) 

* SCN pacemaker + Seasonal 

+ Extra-SCN . i 
Sian Wale xtra-SC| ‘ aes 
Fast Feed Periphery (autonomous) regeneration 

+ Liver (gluconeogenesis) + Epigenetic 

* Pancreas (insulin secretion) changes 


* Adipose (leptin secretion) 
* Immune cells (cytokine production) 


Figure 1 | Circadian adaptation as a unifying model that integrates 
behaviour and physiology. The circadian clock allows light-sensitive 
organisms to synchronize their daily molecular oscillations, behavioural 
rhythms, physiological rhythms and organismal cycles with the rotation 

of Earth on its axis. Core molecular pathways dictate behavioural and 
physiological cycles. This core molecular clock in mammals, expressed 
both in brain and peripheral metabolic tissues, comprises a series 

of transcription-translation feedback loops that include opposing 
transcriptional activators (CLOCK-BMAL1) and repressors (PER-CRY)’. 
The non-phosphorylated PER-CRY complex represses CLOCK-BMALI; 
phosphorylation, in turn, results in the degradation of PER-CRY and 

the turnover of these repressors. In addition, CLOCK-BMALI] induces 
transcription of REV-ERB and of ROR, which regulate BMAL1 expression. 
During the night, PER-CRY is degraded through the ubiquitylation of CRY 
by FBXL3. The circadian clock coordinates anabolic and catabolic processes 
in peripheral tissues with the daily behavioural cycles of sleep-wake and 
fasting-feeding. SCN, suprachiasmatic nucleus. 


the central node in the clock network, which in turn synchronizes 
the hypothalamic control of energy balance, sympathetic outflow 
and the neuroendocrine systems”. A future goal will be to extend 
studies of the circadian neurosensory circuit to understand whether 
factors that affect sensory perception or nutrition in early develop- 
ment might alter synaptology within this circuit. 
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Figure 2| Affect of ageing and environmental disruption on circadian 
control of metabolic processes. The circadian clock partitions metabolic 
processes within the peripheral tissues according to whether we are asleep 
or awake; for example, the pancreatic clock promotes insulin secretion 
during the wake-feeding period”, but the adipose tissue clock promotes fat 
accumulation during the sleep as well as the wake period. Synchronization 
of peripheral tissue clocks and downstream metabolic processes with the 
environmental cycle is crucial for the maintenance of the health of the 
organism***’. We are only just beginning to gain an appreciation of how 
both ageing” and environmental disruption (including changes in diet, 
time of feeding or jet lag) perturb the integration of the circadian and 
metabolic networks’”’. CNS, central nervous system. 


Hypothalamic networks link circadian and energetic centres 
The next question is how does the hypothalamic clock communicate 
with extra-pacemaker and peripheral tissues to produce a coherent 
phase in the circadian systems throughout the organism? An apprecia- 
tion of the central role of the suprachiasmatic nucleus in coordinat- 
ing sleep-wake behaviour, as well as physiological systems, pre-dated 
molecular advances in defining the molecular mechanisms of the 
clock. Master pacemaker cells were first localized to the anterior hypo- 
thalamus"®. Transplantation of the suprachiasmatic nucleus with tissue 
from short-period mutants into animals with damage to the anterior 
hypothalamus imposed short periods on the host, proving that the 
suprachiasmatic nucleus is a master clock, not just a relay station”. 
These studies also suggested that secreted factors from the suprachias- 
matic nucleus contribute to the synchronization of clocks, although the 
identity of these synchronizing factors has been elusive. 

Among the areas receiving suprachiasmatic-nucleus projections, 
a large output is toward the dorsal medial hypothalamic (DMH) 
area”. This circuit has been implicated in a phenomenon known 
as food anticipatory activity (FAA), whereby animals increase their 
activity in response to food provided during the incorrect circadian 
phase, although it is possible that other hypothalamic regions may 
be necessary for FAA”". Moreover, the role of clock genes in FAA is 
controversial. Animals deficient in MC3R, a metabolic signalling 
receptor, have diminished FAA”, suggesting that metabolic rather 
than clock factors may be the principal driver of FAA. 
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In addition to output to the DMH area, suprachiasmatic-nucleus pro- 
jections synapse on orexin-expressing neurons within the lateral hypo- 
thalamic area. Orexin (also known as hypocretin) is a neuropeptide that 
has been found to stimulate arousal and increase energy expenditure 
with intracerebroventricular administration. Deficiency of either orexin 
or its receptor is a hallmark of both canine genetic” and human autoim- 
mune forms of narcolepsy. Interestingly, narcolepsy is also correlated 
with elevated body-mass index™. Furthermore, ablation of orexin recep- 
tors increased susceptibility to diet-induced obesity, suggesting that the 
physiological role of the orexins is to promote arousal and antagonize 
weight gain”. 


Ageing and the circadian system 

Given the extensive integration of the neuroendocrine and circadian 
systems, it is intriguing to note that suprachiasmatic-nucleus function 
declines with age” (Fig. 2). What molecular mechanisms might account 
for the observed effects of ageing on the integrity of circadian systems? 
Studies in circadian-clock-mutant animals have shown the susceptibil- 
ity of certain tissues to damage, such as accelerated cataract formation 
and dermatitis”. In addition, Bmal1-knockout mice have a premature 
death, compared with control mice, that is correlated with increased 
accumulation of ROS”. Deficiency of cryptochrome, a repressor of the 
internal clock repressor, has also been associated with alterations in 
liver regeneration, emphasizing the coupling of circadian and cell-cycle 
pathways”. Although epidemiological evidence suggests there is a link 
between circadian disruption and cancer risk, a full understanding of the 
role of circadian systems in tumorigenesis remains an area for investiga- 
tion. Given the emerging link between metabolic flux and cancer-cell 
survival, it will be especially interesting to determine whether circadian 
systems also intersect with oncogenic pathways through regulation of 
fuel selection (discussed later). 


Circadian origins of metabolic disease 

An emerging theme in both circadian and metabolic studies is that it 
is not only the central nervous system, but also the peripheral tissues 
that modulate sensory response to the environment. A key goal is to 
determine how the circadian cycle modulates homeostatic and nutrient 
responsive pathways to coordinate behaviour and energetics. 


Feeding resets peripheral clocks 

Shifting food availability to the incorrect circadian time (through 
restricted feeding) in animals entrained to a standard light-dark sched- 
ule causes a shift in the peripheral clock in the liver without altering pace- 
maker neuron clocks or overall behavioural rhythm”. One pathway that 
entrains the liver clock to feeding involves glucocorticoid signalling”. 
Temperature also entrains the clock through mechanisms involving tran- 
scriptional control of heat-shock factor protein 1 (refs 32 and 33) (Fig. 3). 
ADP-ribosylation also affects entrainment of the liver to feeding™. 
Together, these findings suggest that multiple levels of transcriptional 
regulation reinforce signalling from the neuronal pacemaker to the liver, 
with the net result of suprachiasmatic-nucleus input restraining the 
response of circulating hormonal or metabolite signals that are related 
to feeding. 


Meal timing affects metabolic syndrome 

Mounting evidence suggests that alignment between central behavioural 
rhythms and feeding time is important in metabolic health. Animals 
fed a high-fat diet shift their pattern of food intake and consume nearly 
all of the excess calories at the incorrect circadian time (during the rest 
period)”. A similar erosion of the partitioning of feeding between activ- 
ity and rest is observed in Clock-mutant* and Npas2-knockout” mice, 
and a high-fat diet induces period lengthening in wild-type mice, which 
is a core property of the clock”. Similarly, mice fed a high-fat diet exclu- 
sively during the rest period have accelerated weight gain compared 
with animals fed during the correct circadian time”, whereas restricting 
access to high-fat food ameliorates diet-induced metabolic syndrome in 
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Figure 3 | Cross-talk between clock transcription and metabolic systems at 
the molecular and physiological levels. Cycling of circadian transcriptional 
activators and repressors” controls fundamental physiological cellular and 
metabolic processes, including gluconeogenesis, oxidative phosphorylation, 
RNA processing and translation, and vesicle trafficking”. This can occur 


mice’’. Even constant exposure to light causes increased insulin resist- 
ance’’. An important goal will be to elucidate the neural and molecular 
basis of the links between altered timing and behavioural and metabolic 
disruption. Conceivably, alterations in the time of feeding may induce 
desynchrony between suprachiasmatic-nucleus firing rhythms and 
input from peripheral feeding responsive signals. Altered excitability 
and activity of certain cell groups may occur within restricted windows 
as a result of either cell-autonomous clock function or non-autonomous 
input from the suprachiasmatic nucleus. Although much of the experi- 
mental evidence that links timing to metabolism has emerged in rodent 
studies, both epidemiological and clinical investigations suggest parallel 
mechanisms may predispose humans to metabolic pathologies”. 


Function of clock genes in metabolic and vascular disease 
Genetic tools to perturb the internal clock have created opportunities to 
analyse the molecular basis of the clustering of certain pathologies within 
limited time windows, including morning myocardial infarction and 
hypertensive crises” (Fig. 2). The basis of increased risk of myocardial 
infarction in the morning is probably multifactorial; circadian tran- 
scription factors have been shown to trans activate the promoter of the 
pro-thrombotic cytokine plasminogen activator inhibitor type 1 (PAI-1) 
through both REV-ERB and E-box motifs, corresponding to oscillation 
in thrombosis risk“. Arrhythmogenesis also occurs more frequently 
in the morning, and is associated with the control of potassium-channel 
expression by myocardial clock genes”. Cardiac clocks also influence 
myocardial contractility and oxidative metabolism”. Both autonomous 
and non-autonomous vascular effects of the clock cause variation in 
blood pressure across the light-dark cycle“. CLOCK expression in the 
vasculature affects the progression of atherosclerosis in xenograft mod- 
els”. Similarly, dyslipidaemia arises in circadian mutants” and may be 
related to dysregulation of core clock genes” and the clock-controlled 
gene Ccrn4l (ref. 51). 


Phase and tissue clocks determine physiological outcomes 

Although the aforementioned studies have analysed mutations in 
multiple tissues, one complexity of the circadian system is that the 
set of clock transcriptional activators and repressors exerts opposing 
effects within different tissues at different times in the 24-hour cycle. 
This is most clearly evident in studies of glucose metabolism. For 
example, Clock-mutant mice are hyperglycaemic and have increased 
susceptibility to diet-induced obesity early in life, but also become 
hypoinsulinaemic with age**’. An explanation for this finding was 
provided by studies in which animals with selective ablation of the 


Temperature cycles 
AMPK 


in various tissue types, but there may be tissue-specific differences. In turn, 
metabolic cycles reciprocally affect the clock: the NAD* biosynthesis*"”, 
peroxiredoxin"' and kinase® cycles generate active intermediates that, along with 
cycles of temperature, provide feedback to regulate the core clock transcriptional 
network. AMPK, AMP-activated protein kinase; ROS, reactive oxygen species. 


circadian gene Bmal1 within the endocrine pancreas were shown 
to have much more profound hyperglycaemia and f-cell failure 
than the multitissue mutants”. Indeed, whole body loss of Bmall 
increases insulin sensitivity” 3 and selective Bmal1 ablation in the liver 
causes hypoglycaemia™. Taken together, these findings suggest that 
CLOCK-BMALI exerts opposing effects in the liver and the pancreas 
— within the liver, it promotes fuel mobilization during fasting, but 
in the pancreas this complex promotes post-prandial insulin exocy- 
tosis. The loss of the CLOCK-BMALI] repressor CRY results in the 
opposite phenotype in the liver, with increased gluconeogenesis and 
enhanced responsiveness to both glucagon and glucocorticoids”. 

One challenge to studies of genetically modified circadian-mutant 
mice is whether the observed abnormalities are dependent on alterations 
in the core timing process or in downstream processes independent of 
the internal clock. Furthermore, because ablation of individual clock 
factors causes compensatory changes in the expression of other factors 
within the core loop — even when studying effects of individual gene 
manipulations — both loss and gain of function analyses become impor- 
tant to determine whether the effects are direct or indirect”. Studies 
in both activator and repressor mutants, and analyses across different 
circadian phases in synchronized tissues, will ultimately elucidate the 
link between gene function, timing and physiology. 


Coupling of circadian and metabolic cycles 

Extensive cross-talk between metabolic and circadian systems allows 
organisms not only to anticipate physiological needs in advance of the 
daily light-dark cycle, but also to adjust the phase of internal cycles in 
response to changes in the environment. The cross-talk between cel- 
lular oscillators and metabolic systems can be traced to overlapping 
transcriptional networks that encode circadian and nuclear-receptor 
signalling pathways (Fig. 4). 


Transcription factors respond to metabolic flux 

At the molecular level, multiple sequence alignment of core clock genes 
originally demonstrated the presence of the Per-Arnt-Sim (PAS) 
domain in the internal clock transcription factors. This conserved 
motif is also found within xenobiotic and hypoxaemic response tran- 
scription factors in addition to kinases and ion channels. PAS domains 
participate in both protein-protein interactions and direct detection 
of small molecules. Haem, a gas-responsive cofactor, binds to the PAS 
domain in a subset of these proteins and detects both carbon monoxide 
in the clock factor NPAS2 and oxygen in the bacterial PAS-containing 
histidine kinase**” (Fig. 4). Haem has also been found to bind to the 
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Figure 4 | Genomic and epigenetic links between circadian and metabolic 
systems. Research has highlighted the existence of a highly dynamic and 
multi-layered network of factors involved in epigenetic transitions across 

the circadian cycle. For example, AMP-activated protein kinase (AMPK)- 
mediated phosphorylation of CRY controls proteolytic degradation of 

the negative arm of the central oscillator. Distinct from the ‘core’ oscillation 
shown in Fig. 1, which uses CKI to phosphorylate and therefore cause the 
degradation of CRY, AMPK entrains the circadian clock to the metabolic 
environment by using the same modification as CKI. The histone demethylase 
JARID 1a (ref. 89) is involved in the activity of CLOCK-BMAL] complexes 
(which regulate PER and CRY transcription) that are bound to E-box motifs. 
This activates the positive arm of the central oscillator by blocking HDAC1 
activity, a function that may be tied to the use of a-ketoglutarate and iron, 

to hydroxylate and remove H3K4 methyl marks on histones. However, this 
demethylation activity seems to be dispensable in circadian regulation. 
NAD*-dependent enzyme SIRT1 activity is dependent on NAD*-regulating 
nicotinamide mononucleotide phosphoribosyltransferase (NAMPT) (a 
transcriptional product of the CLOCK-BMALI complex). SIRT1 in turn feeds 
back to inhibit the CLOCK-BMALI complex”. Deacetylation by SIRT1 
generates nicotinamide (NAM) and O-acetyl-ADP-ribose (Ac-ADPr) as 
by-products. The REV-ERB~-HDAC3-NCORI repressive complex is sensitive 
to haem levels, possibly activating the repressor activity®’”, and binds directly 
to the promoters of clock-controlled genes (RRE), while binding directly to 
CLOCK-BMALI deacetylating nearby histones (not shown). Transcriptional 
activation and repression of glucocorticoid-responsive elements (GREs) by the 
glucocorticoid receptor (GR) are modulated by glucocorticoids and CRY”. 
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PAS domain of the nuclear hormone receptor co-repressor proteins 
REV-ERB-a and REV-ERB-f (refs 60-62). 

Although the hallmark of the circadian cycle is its self-sustained 
activity, changes in circadian gene transcription in response to ener- 
getic flux have also been demonstrated. One example of a chemical 
signal that couples internal clock function to nutrient state involves 
AMP kinase (AMPK), a protein that is activated following intracellular 
nutrient restriction. Stimulation of AMPK leads to the phosphorylation 
and subsequent proteasomal degradation of the repressor CRY. An 
important question to ask is whether alterations in AMPK signalling 
modulate the output of pacemaker neurons to control behavioural and 
physiological rhythms, or whether AMPK signalling affects circadian 
transcriptional cycles in a restricted number of peripheral tissues. 


Nuclear receptors connect endocrine and circadian physiology 
Core clock genes can be defined as those that are necessary for cir- 
cadian behavioural rhythmicity; however, it is intriguing to note that 
data mining suggests that a large fraction of the transcriptome exhibits 
circadian oscillation in tissues such as the liver, although the specific 
number of transcripts designated as oscillating depends on the thresh- 
old set for gene amplitude during bioinformatic extraction™. Indeed, 
many of the nuclear hormone receptors in mice exhibit circadian 
oscillation and, in turn, the timing of interaction between nuclear 
receptors and ligands may be considered a broad feature of the cou- 
pling between temporal and physiological systems”. The coupling of 
circadian and nuclear hormone receptor networks is reflected in the 
overlap between genome occupancy of the orphan nuclear receptor 
ERR-a and of BMAL]I (ref. 66), and in altered circadian function in 
mice that are deficient in the co-activator PGC-1a (ref. 67). Surpris- 
ingly, the circadian repressor PER2 binds to several nuclear hormone 
receptors, whereas CRY binds to the glucocorticoid receptor and 
modulates its function”’. Glucocorticoid has also been shown to mod- 
ulate the expression of REV-ERB-a (ref. 69), a regulator of BMAL1 
(also known as ARNTL) transcription”’. REV-ERB-a agonists have 
been shown to influence circadian function and may protect against 
adverse metabolic consequences of diet-induced obesity’’. Genetic 
studies in which REV-ERB is eliminated only in adults have reinforced 
the link between REV-ERB-a and its homologue REV-ERB-6 in the 
co-regulation of circadian and metabolic homeostasis”. 

Genomic analyses have also begun to increase the evidence 
for co-repressors in coupling circadian and metabolic systems 
(Fig. 4). HDAC3, a class I histone deacetylase that is recruited to 
the REV-ERB-a-NCORI transcriptional complex”, exhibits diurnal 
occupancy of the loci that encode the genes involved in lipogen- 
esis and carbohydrate metabolism”. REV-ERB-a and REV-ERB-f 
occupy a spectrum of lipogenic genes in addition to core clock 
genes”, although it remains unclear whether REV-ERB exerts the 
same effect on both classes of transcripts. For instance, BMAL1 
occupies loci, encoding both internal clock and metabolic genes 
throughout the genome”’. However, analysis of expressed transcripts 
indicates closer phase alignment between genome occupancy and 
RNA transcription for core clock genes. Moreover, although clock 
genes occupy many sites throughout the genome, the core clock 
genes generally exhibit greater variation in expression across the 
cycle compared with output genes. One exception is the very high 
amplitude rhythm of DBP and TEF, two output genes belonging 
to the proline and acidic amino-acid-rich basic leucine zipper 
(PAR-bZIP) family of transcription factors. 

Although genomic analyses bring greater focus to studies of tran- 
scriptional cross-talk, future transcriptome analyses will be necessary 
to determine whether occupancy of loci throughout the genome is 
accompanied by changes in expression. Eventually, it will be important 
to extend chromatin immunoprecipitation and sequencing analyses to 
comparison of the transcriptome in wild-type and knockout mice for the 
factors in question to assess function. For instance, although steatosis is 
observed with hepatic ablation of both HDAC3 (ref. 74) and REV-ERB-a 
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dynamics, with energy minima across the circadian landscape achieved 
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(refs 72 and 73), delineating the relationship between circadian gene tran- 
scription changes and pathologies remains a challenge. The coincidence 
of circadian and metabolic perturbation in these mice may create a vicious 
cycle and augment the adverse effects of dual disruption in timing systems 
and metabolism. 


Epigenetics and circadian cycles 

Clocks also synchronize to the environment through post-transla- 
tional modification of transcription factors and histones that tune 
gene expression rhythms to metabolic state (Fig. 5). NAD" oscillation, 
redox flux, ATP availability and mitochondrial function influence 
acetylation and methylation reactions, and may be important factors 
in circadian synchrony. 


Chromatin transitions impact on core clock gene cycling 

In liver tissue, the rhythmic acetylation of histone 3 corresponds with 
CLOCK-BMAL1-mediated transcriptional activation of Per] and Per2 
genes, and to recruitment of the histone acetyltransferase p300 — an 
event closely coupled to RNA polymerase II binding”. In an addi- 
tional twist, CLOCK itself participates in histone acetylation”*. Feed- 
back repression by CRY proteins abrogates histone acetylation, which, 
together with rhythmic CLOCK-BMALI binding to DNA, probably 
contributes to circadian oscillation in gene expression”’. Further inves- 
tigation will be necessary to determine how the rhythmic assembly of 
activator and repressor complexes influences the kinetics of transcrip- 
tional oscillation, and to delineate how metabolic signals modulate 
dynamic transitions in the epigenetic state. 


Histone deacetylases couple metabolic and circadian cycles 

Studies of histone deacetylase activity have demonstrated mechanistic 
integration between circadian and metabolic processes at the level of 
post-translational protein modification and gene transcription (Fig. 4). 
As already noted, REV-ERB functions together with the co-repressor 
NCOR1 to rhythmically recruit the class I histone deacetylase HDAC3, 
and genetic abrogation of the NCOR1-HDACS3 interaction results in 
both metabolic pathologies and circadian disruption”. In addition, 
interactions occur between CLOCK-BMALI] and the class III histone 
deacetylases belonging to the sirtuin (SIRT, silencer of information 
regulator) superfamily of chromatin-modifying enzymes””*’. SIRT1 
association with the clock activator complex represents an additional 


Phase 1-hour 


shift 


whereas circadian insults lead to off-synchrony pathways in which phase 
and amplitude are misaligned. Such misaligned states may be permanent 
(analogous to kinetic trapping of misfolded polypeptide) or re-aligned. 

I propose a concept of ‘chronostasis’ to describe the circadian synchrony 
landscape (just as proteostasis describes the folding landscape), with both 
cis- and trans-acting factors affecting achievement of energy minima and 
determining trajectory across the topological map. 


mode of autoregulation because CLOCK-BMALI] directly modulates 
the turnover of cellular NAD‘, a cofactor for deacetylase reactions*”*. 
NAD* functions as an electron shuttle in oxidoreductase reactions and 
as a cofactor in deacetylase and ADP-ribosylation modifications, rais- 
ing intriguing questions concerning the role of NAD* in bidirectional 
interactions between circadian and metabolic signalling. Furthermore, 
the NAD*-dependent sirtuins have been established as regulators of 
metabolic pathways in response to calorie restriction and as modula- 
tors of oxidative damage and DNA-repair processes that are central to 
lifespan regulation. As such, the nexus of circadian control of NAD* and 
sirtuin activity may have broad implications for ageing and oxidative 
metabolism, which are particularly relevant in view of the association 
between period length and longevity”. 


Histone methylation in metabolism and circadian rhythms 

In addition to identification of histone acetylation as a key event 
in circadian cycles, biochemical pull down with PER1 has dem- 
onstrated that the clock repressor complex is also associated with 
factors involved in histone methylation, including WDRS (ref. 84), 
whereas the mammalian methyltransferase EZH2 has been shown 
to participate in CRY-mediated repression® (Fig. 4). Conversely, 
rhythmic recruitment of the histone methyltransferase MLL1 par- 
ticipates in gene activation by CLOCK-BMALI (ref. 86). Rhythmic 
trimethylation of histone 3 lysine 4 (H3K4me3) is also involved in 
circadian activation of the clock-controlling gene DBP, whereas 
dimethylation of histone 3 lysine 9 (H3K9mez2) corresponds to its 
repression’’. The link between histone methylation and circadian 
cycles has also been identified in studies of the Arabidopsis thaliana 
clock**. CLOCK-BMAL1 activity has also been shown to involve 
cyclic association of the histone demethylase JARID 1a (ref. 89), 
although a direct link to methylation was not observed. Activity of 
the jumonji C domain demethylases, such as JARID 1a, is coupled to 
cellular redox and mitochondrial energetics because both Fe I and 
a-ketoglutarate are used as cofactors”. Whether changes in cellular 
metabolism may affect circadian systems through alterations in DNA 
methylation remains untested. 

A range of additional mechanisms have been implicated in trans- 
duction of metabolic flux to co-activators and co-repressors, although 
the effect these may have on circadian gene transcription is not yet 
known (Fig. 4). For instance, CTBP1, a co-repressor associated with 
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NCORI, is sensitive to the NADH:NAD* ratio, and CTBP1 itself is 
an NAD* reductase”, making this a potential candidate in the com- 
munication between metabolic changes and circadian oscillations. 


Chronobiology and health and management of disease 
The integrative physiology of circadian and metabolic systems has 
emerged through a combination of biochemical and experimental 
genetic studies; however, emerging approaches in human analyses pro- 
vide an important avenue for future work (Fig. 5). Monogenic disorders 
in sleep onset and waking have provided evidence that clock genes have 
an effect not only on subjective chronotype, but also on neurological 
pathways that regulate sleep in humans. Animal models of these coding 
mutations in humans may be a platform with which to investigate links 
between neuroendocrine homeostasis and circadian systems”. 

One of the more surprising results in human genetic analyses has 
been the association between variants of both CRY2 and MTNR1B 
genes with glucose levels in humans”. Although the pharmacology 
of the melatonin receptor 1B in glucose homeostasis is complex, studies 
in rodents support a role for melatonin signalling in rhythms of insulin 
secretion”’. Studies of Smith-Magenis syndrome, a haploinsufficiency 
disorder localized to RAII and characterized by neural-behavioural 
abnormalities, intellectual deficit, obesity and circadian sleep disruption, 
has provided further evidence for a genetic link between CLOCK gene 
expression and energetic disorders”. Emerging human genetic work par- 
allels evidence from animal-based experimental studies to strengthen the 
hypothesis that genetic signatures of circadian function may be used to 
predict risk for metabolic disorders in humans. 

Although beyond the scope of this Review, both longitudinal pop- 
ulation studies and clinical investigations have indicated there is an 
association between shift work and metabolic disease. For instance, a 
study of nurses, who are one of the best monitored cohorts with a large 
representation of individuals who work shifts, has associated sleep time 
and circadian disruption with a broad range of disorders — including 
type 2 diabetes, gastrointestinal disorders and cancer — that may also be 
modulated by circadian genotype”’. Moreover, sleep loss and circadian 
disruption may be interacting risk factors for developing type 2 diabetes 
in individuals who are predisposed to the disease”. The public health 
implications may be quite broad given the frequency of circadian behav- 
ioural disruption; indeed, the habit of altering bedtime on weekends, 
or ‘social jet lag; has been associated with increased body weight’. 
Laboratory models also suggest there is a direct causal role of circadian 
disruption on glucose tolerance”, although separating the effects of 
circadian disruption from sleep reduction as a result of experimental 
regimens remains a challenge. Ultimately, a combination of clinical, 
genetic and animal paradigms will be needed to understand the links 
between circadian biology and metabolism and to tailor preventive 
interventions and therapies for humans. 


Future horizons and implications of time 

Summer 2012 marked a four-year cycle when we celebrated break- 
ing physical boundaries at the Olympic Games; we also saw NASA's 
Voyager 1 spacecraft reach the farthest distance a man-made object has 
journeyed from our planet. Despite the marvel of progress that these 
images conjure, in our realistic moments we are reminded of the prime- 
val constraints of our simple origins on the surface of Earth, none more 
fundamental than the daily alternation of light and darkness. Indeed, 
recognition of this environmental pressure has marked thinking about 
evolution ever since Darwinian times, despite the abstraction of space 
travel. What is the meaning of this pervasive timescale? 

In green plants, circadian cycles represent an innovation to the dom- 
inant constraint of time. Namely, sessile plants use clocks to defend 
against DNA damage during exposure to sunlight, while optimiz- 
ing oxygenic photosynthesis. In animals, the circadian system also 
provides flexibility in response to environmental challenge, but the 
solutions involve adaptations within both the nervous system and 
peripheral tissues. Nonetheless, some of the same toolkit has been 
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deployed — including the PAS domain module, and the coupling of 
transcriptional oscillators to metabolic outputs. Epigenetic programs 
also fine-tune the clock across prokaryote and eukaryote lineages. 

In the past century, we have also witnessed the invention of electric 
light, television, the jet engine and the Internet. But we are still unable 
to escape from the limits of inner time. 

Although many gaps exist in our understanding, there is compel- 
ling evidence that points towards environmental disruptors of timing 
as agents of metabolic dysregulation (Fig. 5). Although abrogation 
through genetic mutation of the clock pathway is unlikely to explain 
common disease, the technology is now available to test how rare vari- 
ants affect disorders such as type 2 diabetes, obesity and associated 
cardiometabolic complications. Clocks may inform stratification of 
risk of dyslipidaemia, microalbuminuria, retinopathy, neuropathy and 
cardiomyopathy. The discovery of circadian variants that affect glucose 
homeostasis, gluconeogenesis and B-cell function raises the possibility 
that pharmacological modification of molecular clock function may 
have therapeutic benefits. Given the window that circadian systems 
provide to understanding the partitioning and flux of fuel across dif- 
ferent phases of the fasting-feeding cycle, it is likely that insight into the 
clock system may also provide an understanding of metabolic fate. This 
may include processes such as cellular regeneration and proliferation, 
and the switch from quiescent to active states in haematopoietic tissues. 
Although we are fixed in time by the clock in our genes, it is reasonable 
to predict that drugs, and even nutraceutical interventions, may soon 
be in hand to selectively alter time — even within specific tissues and 
cells — as a means of improving robustness, adaptability and health. m 
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Central nervous system 
control of metabolism 


Martin G. Myers Jr’? & David P. Olson* 


Although it is a widely held thought that direct hormone action on peripheral tissues is sufficient to mediate the control 
of nutrient handling, the role of the central nervous system in certain aspects of metabolism has long been recog- 
nized. Furthermore, recent findings have suggested a more general role for the central nervous system in metabolic 
control, and have revealed the importance of a number of cues and hypothalamic circuits. The brain’s contributions 
to metabolic control are more readily revealed and play a crucial part in catabolic states or in hormone deficiencies 


that mimic starvation. 


he survival of multicellular organisms depends on the appropri- 
ate uptake and release of nutrients by major metabolic tissues. In 
the absence of continuous feeding, the availability of metabolic 
fuels (for example, glucose, fatty acids and amino acids) for use in tis- 
sues is maintained by storing nutrients, which are later released at the 
appropriate time and rate. Hormones that are secreted by the pancreatic 
islets of Langerhans modulate important aspects of nutrient uptake and 
storage (insulin) or their release into the circulation (glucagon). They do 
this partly by acting directly on the tissues that are the main reservoirs 
for these nutrients (for example, liver, adipose and muscle tissue). 
Although the direct actions of these hormones on metabolic tis- 
sues are crucial for whole-body metabolic homeostasis, several central 
nervous system (CNS)-regulated systems have long been recognized 
to control important aspects of metabolism. For instance, the elabora- 
tion of catecholamines by the sympathetic nervous system (SNS), along 
with hypothalamically controlled hormones (such as glucocorticoids 
and thyroid hormone), functions in concert with glucagon to mediate 
the counter-regulatory response, promoting nutrient release into the 
blood and favouring substrate use over storage’. Well-recognized CNS 
pathways mediate crucial aspects of this counter-regulatory response. 
Recent data reveal a more general role for CNS pathways in the modu- 
lation of metabolism: various nutrient, energetic and hormonal cues 
(such as insulin and the adipose-derived hormone leptin) function in 
the hypothalamus to control glucose and lipid metabolism, in addi- 
tion to overall energy balance”’ (Fig. 1). Several hypothalamic areas, 
especially the ventromedial nucleus and arcuate nucleus (including the 
arcuate melanocortin system), as well as the hindbrain, make important 
contributions to these effects. Although the peripheral response to the 
hormones that control metabolism (for example, insulin and glucagon) 
tends to obscure these CNS contributions under many conditions, the 
brain controls overall metabolic tone and is crucial when the peripheral 
systems cannot compensate — especially when pancreatic hormone 
output is absent or cannot be modulated. 


Established autonomic and neuroendocrine roles 

As anyone who has examined parameters of glucose homeostasis in 
mammals knows, stresses (such as pain or restraint) increase blood 
sugar. This reflects the activation of the SNS and the hypothalamic-—pitu- 
itary—adrenal axis, which promote the breakdown of macromolecular 


forms of stored energy (such as glycogen, triglyceride and protein) for 
release into the bloodstream (as glucose, fatty acids and glycerol, and 
amino acids). Furthermore, SNS-mediated catecholamine action in the 
islets promotes increased glucagon release while suppressing insulin 
secretion; the parasympathetic nervous system (PNS) mediates oppos- 
ing actions’. Therefore, stress signals, which are coordinated by the 
CNS, modulate autonomic and neuroendocrine function to increase 
blood glucose. Furthermore, this stress-induced hyperglycaemia com- 
plicates the treatment of diabetes by promoting poor glycaemic control”. 

CNS-mediated modulation of glucose homeostasis is not unique to 
anxiogenic stresses, but it is also a crucial response to physiological 
stresses, such as exercise or infection, in which increased circulating 
nutrient availability is required to meet the needs of muscle contraction 
or the immune response, respectively®. Indeed, metabolic cues can also 
promote this response: hypoglycaemia — a common manifestation of 
iatrogenic insulin overdose — not only promotes glucagon secretion 
over insulin production at the level of the islets, but also acts directly 
in the brain to increase SNS activity and glucocorticoid production (as 
well as food intake), directing the return to normoglycaemia’ *. Within 
the brain, specialized glucose-sensing neurons that lie in several sites in 
the brainstem (including the nucleus tractus solitarius and other parts 
of the dorsal vagal complex) and hypothalamus (such as the ventrome- 
dial nucleus) respond to the decreased availability of glucose (the main 
metabolic fuel for the brain under most conditions) to promote this 
counter-regulatory response’. The importance of this system is under- 
scored by the consequences of its failure to respond appropriately to 
hypoglycaemia in individuals, following repeated exposure to hypo- 
glycaemia as a consequence of intensive insulin therapy. Such impair- 
ment increases the severity and frequency of hypoglycaemic events, and 
diminishes the patient's ability to effectively recognize their occurrence. 


A larger role for the CNS in metabolic control 

In addition to the well-established role for the brain in mediating 
important aspects of the counter-regulatory response to hypogly- 
caemia and other stresses, results over the past 15 years have dem- 
onstrated a more general role for the CNS in the sensing and control 
of whole-body metabolic homeostasis”"®. This theme has emerged 
from the search for signals that communicate the status of adipos- 
ity or energy stores to the brain to control food intake and energy 
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Figure 1 | Hypothalamic pathways in the control of metabolism. Recent 
findings have implicated medial basal hypothalamic nuclei (the arcuate 
and ventromedial nucleus) in the control of metabolism. Leptin, secreted 
from adipose tissue as an indicator of long-term energy stores, and insulin, 
reflective of recent food intake in addition to adipose stores, function in 
these nuclei to modulate the autonomic nervous system to increase hepatic 
glucose production and peripheral glucose disposal. Other signals of energy 
surfeit, including glucose and fatty acids, act in concert with these signals, 
whereas signals of energy deficit (such as hypoglycaemia and activation of 
AMP-dependent protein kinase (AMPK) and ghrelin) act in the opposite 
manner. Although the ventromedial nucleus neural pathways that mediate 
the responses to these stimuli have not been molecularly characterized, 
the arcuate nucleus melanocortin system (which is composed of pro- 
opiomelanocortin (POMC) neurons that act in part through melanocortin-3 
receptor (MC3R) and melanocortin-4 receptor (MC4R), and is antagonized 
by Agouti-related protein (AgRP)-producing neurons) contributes. 


expenditure. Insulin — the first hormone to be studied in this regard 
— exhibits elevated baseline blood concentrations with increasing 
adiposity, and increases rapidly in response to feeding. Acute admin- 
istration of insulin directly into the CNS tends to suppress feeding”. 

The cloning of the gene that encodes leptin, which is secreted by 
adipose tissue in approximate proportion to body fat stores, was a water- 
shed event in our understanding of the role of the CNS in the control of 
energy balance and metabolism". A lack of leptin or its predominantly 
CNS-expressed receptor, LRb, not only promotes massive hyperphagia 
and decreased energy expenditure (with subsequent obesity), but also 
results in early-onset insulin resistance, hyperglycaemia and metabolic 
dysfunction, which are, at least in rodents, disproportionately severe 
in relation to that expected with obesity alone. Indeed, whole-body or 
CNS-restricted leptin treatment of leptin-deficient ob/ob mice rapidly 
restores glycaemic control independently of changes in food intake 
or adiposity, suggesting that leptin acts in the brain to control blood 
glucose levels independently of energy balance’”’’. These observations 
provoked a careful examination of the roles for CNS insulin and leptin 
action in the control of whole-body metabolism (mainly the control 
of glucose production and disposal)'*”*. These studies reported that 
CNS insulin action contributes to the suppression of hepatic glucose 
output, and CNS leptin action promotes increased hepatic glucose 
flux by increasing gluconeogenesis while diminishing glycogenolysis. 
In diet-induced obese rodents, leptin mainly suppresses glycogenoly- 
sis to decrease net hepatic glucose production (HGP)*”*. Ina similar 
vein, adiponectin and glucagon-like peptide 1 (GLP-1), like insulin, 
suppress gluconeogenesis and HGP following intracerebroventricular 
administration (although GLP-1-producing CNS neurons, rather than 
the gut, could represent the physiological source of CNS GLP-1 in this 
case)'”"®. By contrast, intracerebroventricular ghrelin (which opposes 
leptin action in most cases) increases HGP, as does the adipose-derived 
hormone resistin (which attenuates insulin action)’*”’. 

Subsequent genetic studies have demonstrated crucial roles for CNS 
leptin and insulin signals (including those mediated by elements of 
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the insulin receptor substrate (IRS) 2 phosphatidylinositol (PI)-3-OH 
kinase-phosphoinositide-dependent kinase 1 (Pdk1) pathway that sup- 
press the transcription factor FOXO1) in the control of glucose homeo- 
stasis”’”* (Fig. 2). Furthermore, impairing leptin and insulin action in 
the same CNS circuits provokes substantially greater metabolic dysfunc- 
tion than that observed with ablation of either one alone, suggesting a 
synergy between the insulin and leptin signalling pathways in the CNS 
control of metabolism” (Fig. 2). 

In addition to hormones, nutrients and signals of cellular energy 
status also have a role in the CNS control of metabolism. As might 
be predicted from the response to hypoglycaemia, pathways that 
monitor cellular energy status in the hypothalamus also modulate 
food intake and whole-body metabolism. Activation of hypotha- 
lamic AMP-dependent protein kinase (AMPK) (which is stimulated 
by the depletion of cellular ATP levels) increases food intake and 
circulating glucose concentrations”. Roles in the CNS for amino 
acids, their sensing by the mammalian target of rapamycin pathway 
and mitochondria-derived reactive oxygen species in the control of 
peripheral metabolism have not yet been examined. However, these 
cellular signalling systems contribute to the control of food intake 
and energy homeostasis”*”’, and could also participate in the CNS 
control of metabolism. 

In addition to the response to hypoglycaemia, glucose sensing in 
the hypothalamus also controls peripheral glucose handling: intracer- 
ebroventricular glucose administration decreases HGP through a mech- 
anism that requires cellular ATP generation and the subsequent closure 
of ATP-sensitive potassium (K*-ATP) channels’. Similarly, although 
fatty acids are not typically thought of as a major fuel for the CNS, in 
the hypothalamus fatty acids and the systems that mediate their mito- 
chondrial import and oxidation control food intake and blood glucose 
levels”*. Intracerebroventricular injection of oleic acid (and, to a lesser 
extent, other fatty acids) suppresses HGP, as do a variety of pharmaco- 
logical and short-hairpin-RNA-mediated manipulations designed to 
alter hypothalamic fatty-acid oxidation and increase cellular concentra- 
tions of fatty-acid-coenzyme A (CoA). These studies suggest there is an 
important role for fatty-acid metabolism, and potentially for fatty-acid- 
CoA as an intracellular mediator, in nutrient sensing by hypothalamic 
neurons that modulate hepatic glucose handling. 

Although most data that link the CNS to peripheral metabolism 
have examined the control of blood glucose concentrations as their 
surrogate for metabolism, a number of studies also suggest that 
these pathways may similarly modulate lipid handling throughout 
the body. Intracerebroventricular injection of leptin, for instance, 
suppresses lipogenesis in adipose tissue”. Moreover, genetic or phar- 
macological manipulation of CNS melanocortin action can alter sys- 
temic lipid metabolism”. 


CNS circuits and cellular signals in metabolic control 

Brain-wide and circuit-specific genetic manipulation of intracellular 
mediators of leptin or insulin action have demonstrated important 
roles for these signals in the CNS for the control of metabolism. 
Although genetic tools that manipulate many components of the 
cellular nutrient-sensing apparatus are not available or practi- 
cal (because of factors such as redundant isoforms or embryonic 
lethality), direct injection of pharmacological inhibitors or short- 
hairpin RNA reagents to target these pathways has suggested that the 
hypothalamus, especially the medial basal hypothalamus (MBH) — 
which includes the ventromedial and arcuate nuclei — is an impor- 
tant site of action’. Other hypothalamic regions (such as the lateral 
hypothalamic area) and the brainstem contain neural populations 
that sense leptin and nutrients, including glucose and amino acids, 
but roles for these areas in the control of metabolism by direct leptin 
or insulin action have not been reported’”'. However, the dorsal 
vagal complex (including the nucleus tractus solitarius) partici- 
pates in the sensing of hypoglycaemia and the counter-regulatory 
response, and represents a potential contributor to overall metabolic 
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control. The dorsal motor nucleus of the vagus controls outputs to 
the periphery through the vagus nerve, and it is well-positioned to 
regulate metabolism. 

The heterogeneity of brain tissue — in which even small and cir- 
cumscribed brain areas (such as the arcuate nucleus) contain multiple 
types of neurons with distinct, or even opposing, functions — means 
that understanding the specific cell types and the circuits that are 
involved in metabolic control requires the use of cell-specific genetic 
tools*””*. Although it is difficult to analyse the cellular pathways that 
control fatty-acid-CoA levels and sensing of cellular energy using 
neuron-specific techniques, many aspects of insulin and leptin action 
can be investigated with the tools that are available. For instance, given 
the important and synergistic roles that insulin and leptin have in the 
CNS control of metabolism, insulin signalling in LRb neurons might 
be an important mechanism to analyse. The redundancy of signalling 
by insulin and insulin-like growth factor 1 (IGF-1) receptors renders 
it difficult to analyse through the simple deletion of insulin recep- 
tors from LRb neurons (indeed, the metabolic phenotype of insulin- 
receptor deletion throughout the brain is rather modest). However, 
deletion of IRS2 — which is the crucial shared second messenger 
protein for insulin and IGF-1 receptors — from the entire brain or 
specifically from LRb neurons results in early insulin resistance and 
glucose intolerance. 

Additionally, roles for direct LRb signalling in the control of energy 
homeostasis have been examined in mouse models, in which LRb 
mutants that are defective for specific signalling pathways replace 
the endogenous LRb. This analysis revealed crucial roles for LRb-sig- 
nal transducers and activators of transcription 3 (STAT3) signalling 
in the control of energy balance, although the mice demonstrated 
improvements in glucose homeostasis relative to mice that were 
completely null for LRb****. However, the improvements in glycae- 
mic control may be secondary to increased overall glucose disposal 
because HGP remains high in animals that carry mutations that block 
LRb-STAT3 signalling. Another pathway, potentially mediated by 
the LRb-associated tyrosine kinase Jak2, or other LRb signals that are 
distinct from phosphorylation sites on LRb, also seems to contribute 
to the control of glucose homeostasis by leptin’”””*. 
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Figure 2 | Cellular signalling pathways 
modulated by leptin and insulin. The insulin 
receptor (InsR) and the related insulin-like 
growth factor 1 receptor (IGF-1R) act through 
their intrinsic tyrosine kinases to promote 

the phosphorylation of receptor tyrosine 
residues (pY), leading to the recruitment 

and phosphorylation of the insulin receptor 
substrate (IRS) proteins-1 and -2. These recruit 
phosphatidylinositol 3-kinase (PI3K), which 
activates phosphoinositide-dependent kinase 1 
(PDK1) and Akt to promote, among other things, 
the phosphorylation and nuclear exclusion of 
the FOXO1 transcription factor, inactivating 
FOXO1-mediated transcription. Phosphorylated 
IRS1 or IRS2 also recruit GRB2, which promotes 
extracellular-signal-regulated kinase (ERK) 
activation. Stimulation of leptin receptor (LRb) 
activates the associated Jak2 tyrosine kinase to 
promote the phosphorylation of intracellular 
tyrosine residues on LRb. One of these residues 
recruits the suppressor of cytokine signalling 3 
(SOCS3) and the protein-tyrosine phosphatase 
SHP-2. SHP-2 recruits GRB2 to activate ERK 
signalling. Two additional phosphorylated LRb 
residues recruit latent signal transducers and 
activators of transcription (STAT3 and STATS), 
which then translocate to the nucleus to modulate 
gene transcription. LRb also acts through 
undefined pathways to modestly promote PI3K 
pathway signalling. 


Cell 
membrane 


Thus, specific leptin- and insulin-mediated signals in LRb neurons 
contribute to glucose homeostasis. LRb neurons may be an important 
locus at which cellular nutrient and energy sensing could also have a 
notable metabolic role, although this has yet to be tested. 


A role for the ventromedial nucleus 

Within the MBH, the ventromedial nucleus is an obvious place to begin 
examining the hypothalamic circuits that control overall metabolism, 
owing to this region’s important role in sensing glucose levels and in 
aspects of the counter-regulatory response”. Manipulation of AMPK 
activity in the ventromedial nucleus by stereotactic injection of inhibi- 
tors or activators suggests a role for AMPK signalling in this nucleus in 
sensing low glucose concentrations and mounting the counter-regula- 
tory response to hypoglycaemia. A role for the ventromedial nucleus 
in glucose metabolism has also been implied by the manipulation of 
glucokinase and the K*-ATP channel by stereotactic injection of phar- 
macological agents or short-hairpin RNA reagents into this nucleus’. 

Leptin activates at least some neurons in the ventromedial nucleus, 
which contains a substantial group of LRb neurons”. This region also 
seems to have a role in the control of glucose disposal by leptin. The 
injection of leptin into the ventromedial nucleus, but not the adjacent 
arcuate nucleus, increases sympathetic output and glucose uptake 
by muscle and brown adipose tissue (both of which have heavy SNS 
innervation)**. The deletion of LRb specifically from the ventromedial 
nucleus in mice has shown a role for ventromedial-nucleus leptin 
action in the control of food intake and overall energy expenditure”. 
Although glucose metabolism in these animals has not been completely 
characterized, they are less glucose tolerant at early ages than control 
mice — before the onset of frank obesity — consistent with the idea that 
leptin action in the ventromedial nucleus promotes glucose disposal 
independently of the control of energy balance. 

Insulin action in the ventromedial nucleus also contributes to the 
control of metabolism. Injection of insulin into this area promotes 
peripheral glucose disposal, whereas knockdown of insulin receptors 
in the ventromedial nucleus of adult mice causes glucose intoler- 
ance”. Similarly, enhancing the action of insulin in the ventromedial 
nucleus by interference of the insulin-inhibited FOXO1 pathway 
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promotes increased glucose disposal in muscle tissue’. 

Although local hypoglycaemia (or AMPK activation) in the ventro- 
medial-nucleus promotes a counter-regulatory response and glucose 
production, ventromedial-nucleus leptin and insulin signals promote 
glucose uptake by metabolically active tissues. Thus, whereas hypo- 
glycaemia, and leptin or insulin in the ventromedial nucleus promote 
SNS-mediated processes, the specific outcome differs between stimuli. 
This discrepancy could reflect differences in the effects of each stimulus 
on the same cell type, or it might indicate that distinct (ventromedial 
nucleus) cell types respond to each stimulus. Unfortunately, systems to 
molecularly define and genetically perturb subtypes of ventromedial- 
nucleus-restricted neurons still need to be developed. 


Arcuate nucleus and hypothalamic melanocortin system 
Neural systems in the arcuate nucleus are thought to have a crucial 
role in the control of glucose homeostasis. Ablation of all arcuate- 
nucleus neurons with monosodium glutamate produces not only 
obesity but also frank hyperglycaemia, suggesting that the net tone 
of the arcuate nucleus tends to suppress hepatic glucose output”. By 
contrast, ventromedial hypothalamus lesions produce obesity, but 
also impair hypoglycaemia sensing. Of note, however, is that lesion- 
ing studies reveal nothing about the roles of specific populations of 
neurons within these nuclei, but only reveal the aggregate output for 
the entire area. 

Arcuate-nucleus-directed LRb expression in rats and mice that are 
null for LRb alters body weight and food intake only modestly, but 
markedly ameliorates hyperglycaemia in these animals’. Similarly, 
adenoviral augmentation of insulin signalling pathways in the arcuate 
nucleus also improves glucose homeostasis in LRb-null animals™. 
Thus, leptin and insulin-like signalling in the arcuate nucleus are 
important for the suppression of glucose production. 

Our understanding of the molecular make-up of neurons in the 
arcuate nucleus has advanced markedly over the past 15 years**”. 
This advance has permitted the development of a substantial array 
of molecular genetic tools with which to probe the function of spe- 
cific sets of arcuate-nucleus neurons. Although the arcuate nucleus 
contains other cell types, two main opposing types of output neu- 
rons have been defined: those that express pro-opiomelanocortin 
and those that contain Agouti-related protein (AgRP) and neuropep- 
tide- Y. In the arcuate nucleus, pro-opiomelanocortin is processed to 
produce a-melanocyte stimulating factor (a- MSH) — an agonist for 
the melanocortin-3 receptor (MC3R) and melanocortin-4 receptor 
(MC4R) — which collectively promote activity, energy expenditure 
and suppress food intake. AgRP is an inverse agonist for MC3R and 
MCAR, opposing central melanocortin action, and neuropeptide-Y is 
an inhibitory neuropeptide that suppresses energy use and promotes 
food intake. AgRP neurons also contain and release the inhibitory 
neurotransmitter GABA (y-aminobutyric acid) as do most of the 
other non-pro-opiomelanocortin neurons in the arcuate nucleus”. 
Insulin and leptin both inhibit AgRP neurons, and insulin action on 
these neurons suppresses HGP*”™. 

Pro-opiomelanocortin neurons and their roles in controlling glu- 
cose homeostasis are more complicated. Recent data suggest there are 
multiple subtypes of pro-opiomelanocortin neurons, including those 
that respond to leptin or serotonin (each of which activates distinct 
subtypes of pro-opiomelanocortin neurons) and those that are inhib- 
ited by insulin’’. Additional arcuate-nucleus neurons that transiently 
express pro-opiomelanocortin during development, but that later 
switch their fate to non-pro-opiomelanocortin-expressing cells have 
also been identified”. Pharmacological studies suggest that melano- 
cortin action increases hepatic glucose output, as does insulin action 
on pro-opiomelanocortin neurons in a background null for insulin 
receptors outside the liver’*”*. By contrast, expression of LRb in pro- 
opiomelanocortin neurons in an otherwise LRb-null background 
suppresses HGP, whereas simultaneous deletion of LRb and insulin 
receptors from pro-opiomelanocortin neurons increases HGP”, 
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Impairing PI(3)K action in all pro-opiomelanocortin neurons also 
promotes insulin resistance”. Thus, although it is clear that the action 
of insulin and leptin on pro-opiomelanocortin neurons — along with 
the action of melanocortin in general — modulates HGP, the exact role 
of this pathway remains unclear. Presumably, the exact genetic, hor- 
monal and physiological milieu of the animal affects the response to 
melanocortin activation. Although the relevant variables remain to be 
defined, the indirect regulation of pro-opiomelanocortin neurons by 
leptin action on other hypothalamic neurons could bea contributor”. 

Genetic or pharmacological manipulation of central melanocortin 
action also alters systemic lipid metabolism in rodents”’. Specifically, 
inhibition of melanocortin action promotes lipid uptake, triglyceride 
synthesis and accumulation of fat in white adipose tissue; central mel- 
anocortin blockade also increases circulating high-density lipoprotein 
cholesterol®*. Importantly, these effects are independent of food intake 
and body weight. Central melanocortin stimulation results in fat-depot- 
specific increases in phosphorylated perilipin A and hormone-sensi- 
tive lipase, presumably through sympathetic outflow”. Whether these 
effects of melanocortin action will be recapitulated in humans remains 
to be determined, but these findings raise the possibility that central 
melanocortin receptors may be a therapeutic target for the lipid disor- 
ders associated with obesity and diabetes. 


Melanocortin output 

MC3R and MC4Rare the predominant melanocortin receptors in the 
CNS. MCAR is expressed fairly widely in the CNS, whereas MC3R has 
more limited expression. Both receptors are expressed in the hypo- 
thalamus: MC3R is expressed heavily in the arcuate nucleus and ven- 
tromedial nucleus, whereas MCAR is expressed particularly strongly 
in the paraventricular hypothalamic nucleus. 

Whole-body deletion of MC4R results in massive hyperphagic obe- 
sity, increased lean mass, and hyperinsulinaemia that is out of propor- 
tion to the obesity®. Much of the metabolic disturbance observed in 
Mc4r-knockout mice may be secondary to their obesity, although 
Mc4r-knockout mice display decreased energy expenditure and 
increased metabolic efficiency prior to the onset of hyperphagia”. 
Mc3r-knockout mice on the other hand do not display increased food 
intake and are more modestly obese, but they also have decreased 
lean mass and an abnormality in fuel partitioning that preferentially 
stores energy as fat®’. When challenged with a high-fat diet, male 
Mc3r-knockout mice gain weight rapidly and develop hyperinsulin- 
ism similar to that seen with Mc4r-knockout mice”. This suggests 
a potentially important role for MC3R in mediating metabolic (rather 
than energy balance) responses to melanocortin action, although the 
extent to which this reflects altered nutrient partitioning relative to 
direct control of other metabolic parameters remains unclear. The 
function of these two melanocortin receptors in energy balance does 
not overlap because deletion of both receptors results in greater obe- 
sity and metabolic dysfunction than does the loss of either receptor 
alone®. This additive effect underscores the importance of delineat- 
ing the identity and neural circuitry used by both MC3R and MC4R”. 

Unfortunately, the low expression of MC3R and MCA4R in most 
melanocortin-responsive neurons has prevented the detailed map- 
ping of the melanocortin circuitry with much cellular specificity. 
Nevertheless, pharmacological or viral manipulation of melanocortin 
action by site-specific injection in multiple areas of the brain has 
been shown to alter energy balance’'”’. The effects of such manoeu- 
vres on the metabolic actions of central melanocortins have not been 
fully explored. Genetic approaches have also been used to identify 
the specific neurons and circuits under melanocortin control that 
regulate metabolism, and these have suggested that central melano- 
cortin action is dissociable in the CNS, with discrete brain regions 
mediating specific melanocortin effects (Box 1 and Fig. 3). Whether 
the metabolic effects of melanocortins are mediated mainly through 
neuroendocrine pathways or the autonomic nervous system or both 
has yet to be definitively determined. 
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BOX1 


Genetic and pharmacological studies have begun to map the 
functional effects of central melanocortin action on specific regions 
of the brain. Pharmacological approaches using melanocortin 
agonists and antagonists have demonstrated a role for melanocortin 
action in the hindbrain and central amygdala in the regulation of 
food intake’®®°. Stereotactic injection of viral vectors that express 
melanocortin-receptor modulators has also suggested a role for 

the paraventricular nucleus, ventromedial nucleus and latera 
hypothalamic area in mediating the effects of melanocortin action on 
food intake”*. Such studies are unable to assign these physiological 
responses to a specific melanocortin receptor, because commonly 
used melanocortin agonists engage both MC3R and MCAR. Genetic 
approaches using neuron-specific Cre-loxP technologies have begun 
to address this issue®!**: selective re-expression of MC4R in an 
otherwise null background has revealed the importance of neurons 


Sites of melanocortin action 


within the paraventricular nucleus (and possibly the amygdala) in 
mediating the anorectic actions of MCA4R. The role of these neurons 

in directly controlling metabolic effects is hard to decipher given 

the profound affect that restoring MC4R on these neurons — in an 
otherwise MC4R-null background — has on the obesity phenotype. By 
contrast, selective expression of MCAR in cholinergic, preganglionic 
autonomic output neurons alters energy expenditure and hepatic- 
insulin sensitivity in the obese Mc4r-knockout background. It is 
important to note that the subcellular location of functional MCAR (cell 
body compared with axon terminal) has not been clearly established, 
and it is possible that melanocortin ligands may exert their effects on 
melanocortin receptors expressed at terminals at a distance from the 
cell body. The development of similar genetic tools for MC3R should 
permit a more detailed neuron- and receptor-specific analysis of 
central melanocortin action®. 


The brain and the physiological control of metabolism 

Although the participation of the brain in the response to hypo- 
glycaemia is well-established and has an impact on the therapy for 
patients with insulin-deficient diabetes, the more recently described 
role of sensing of hormones and nutrients in the brain for the con- 
trol of whole-body metabolism was initially met with some scep- 
ticism”’. Specifically, the roles of these brain systems have been 
defined largely at the limits of physiology — in many cases, under 
hyperinsulinaemic ‘clamp’ conditions that lock the contribution of 
the islets at a specific level, so that alterations in insulin or glucagon 


secretion cannot compensate for changes in glucose production or 
disposal. The study of these brain systems often requires such con- 
ditions because the reserve insulin and glucagon secretory capacity 
of the islets can otherwise compensate for alterations in glucose 
handling by the major metabolic organs, obscuring the effect of 
the brain on the control of glucose production and disposal. To 
what extent these brain systems are relevant is controversial. They 
do, however, have a role in leptin deficiency: not only does leptin 
ameliorate hyperglycaemia in animals’ that are genetically leptin 
deficient, but it also improves metabolic function — by decreasing 


POMC 


AgRP 


Ventromedial 
nucleus 


Arcuate 
nucleus 


| 


Feeding efficiency 
Nutrient partitioning 
Lean mass accrual 


Possible sites including: 
PVH/Amygdala 


DMV IML Unknown 
Insulin Energy Lipid 
secretion expenditure handling 


VMH, LH 
DMV/NTS 


| 


Satiety 


Figure 3 | CNS melanocortin action. The pro-opiomelanocortin (POMC) 
neurons of the arcuate nucleus produce melanocortin-receptor agonists, 
whereas Agouti-related protein (AgRP)-producing neurons antagonize 
melanocortin action. The two predominant CNS melanocortin receptors, 
MC3R and MCAR, mediate distinct effects. MC3R is expressed at high 
levels in the arcuate nucleus and ventromedial nucleus, and mainly controls 
the conversion of food to fat, nutrient partitioning and build-up of lean 
mass, whereas MC4R predominantly mediates effects on food intake. 

The specific neurons that express MC4R are unclear; however, genetic 

data suggest important roles for the paraventricular hypothalamic (PVH) 


nucleus and amygdala MC4R in these effects, whereas other data, including 
pharmacological data, suggest roles for MC4R at other sites such as the 
ventromedial (VMH) nucleus, lateral hypothalamus (LH), dorsal motor 
nucleus of the vagus (DMV) and nucleus tractus solitarius (NTS), as well as 
the lateral parabrachial nucleus. In addition to mediating metabolic effects 
attributable to body weight, MC4R mediates the control of insulin secretion 
through the dorsal motor nucleus of the vagus and energy expenditure 
through the intermediolateral cell column (IML) of the spinal cord. MC4R also 
contributes to lipid handling in the body, but the MC4R-expressing site or sites 
that mediate these effects remain undefined. 
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blood sugar and lipids — in humans and mice with lipodystrophy 
syndromes’*”*. In lipodystrophy (in which the lack of adipose tissue 
results in functional leptin deficiency), circulating insulin concen- 
trations are elevated, and exogenous insulin administration fails to 
adequately suppress hyperglycaemia and hyperlipidaemia. Thus, 
in this situation, leptin action (presumably in the brain) improves 
metabolic parameters that cannot be ameliorated by very high insu- 
lin levels. More commonly, brain leptin and insulin make important 
contributions to metabolic homeostasis in B-cell failure (for exam- 
ple, in insulin-deficient diabetes)”°’”. In rodents with insulin insuf- 
ficiency due to chemical or autoimmune 6-cell destruction, leptin 
(including intracerebroventricular leptin) normalizes blood glucose 
in the absence of exogenous insulin, and improves glycaemic control 
as an adjunct to insulin therapy. Leptin action is also likely to have 
an important role in glucoprivation, in which insulin is either fixed 
and artificially high (in insulin overdose) or fixed near zero (with 
inhibitors of glucose metabolism, such as 2-deoxyglucose). 

Thus, the need to eliminate the islet contribution to metabolic con- 
trol in order to observe the effects of the brain-hormone and nutrient- 
sensing pathways does not diminish the importance of these brain 
systems in the control of peripheral metabolic homeostasis. Instead, it 
suggests that the CNS serves to raise and lower the overall tone of the 
peripheral response, and dominates the control of metabolic homeo- 
stasis when pancreatic islets are unable to compensate. 


Questions for the future 

Although we now have a much clearer picture about how the brain 
contributes to metabolic homeostasis than we did a decade ago, many 
questions still remain. CNS perturbations have been seen to modulate 
processes such as glucose production and lipid metabolism in the 
periphery, but the circuits and mechanisms that intervene between 
the brain and systemic nutrient handling are poorly defined. Cer- 
tainly, the SNS contributes to some of the outcomes, such as in the 
promotion of glucose disposal into skeletal muscle. Similarly, vagot- 
omy — including sub-diaphragmatic vagotomy — prevents many of 
the changes in hepatic-glucose metabolism that occur in response 
to intracerebroventricular or MBH-directed injections, suggesting 
that the parasympathetic nervous system has a role*. Additionally, 
the control of glucagon secretion may contribute to the suppression 
of blood glucose by CNS leptin”’. Determining how each of these 
systems contributes to the various responses to brain nutrient and 
hormone action, and the specific brain regions and circuits that con- 
trol the response, will be important for our understanding of the 
physiology of metabolism. 

Within the brain itself, we have an enormous amount left to learn. 
In addition to resolving the mechanisms through which the leptin and 
melanocortin systems contribute to various metabolic conditions, per- 
haps the most important issue to resolve is the cellular specificity of the 
system. Clearly, some progress has been made in this regard, especially 
in the arcuate nucleus, for which some genetic reagents are available 
for use in analysis; however, we desperately need other methods of 
genetic analysis to examine defined subpopulations of ventromedial- 
nucleus and arcuate-nucleus neurons, and to clarify the roles of other 
hypothalamic and brainstem circuits that have been implicated in 
metabolic control. = 
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How cancer metabolism is tuned for 
proliferation and vulnerable to disruption 


Almut Schulze! & Adrian L. Harris” 


Cancer metabolism has received a substantial amount of interest over the past decade. The advances in analytical 
tools have, along with the rapid progress of cancer genomics, generated an increasingly complex understanding of 
metabolic reprogramming in cancer. As numerous connections between oncogenic signalling pathways and meta- 
bolic activities emerge, the importance of metabolic reprogramming in cancer is being increasingly recognized. The 
identification of metabolic weaknesses of cancer cells has been used to create strategies for treating cancer, but there 
are still challenges to be faced in bringing the drugs that target cancer metabolism to the clinic. 


characteristic alterations in their metabolic activity. These 
early studies resulted in the hypothesis that irreversible inac- 
tivation of respiration is causally involved in the development of 
tumours’. Later, increased rates of glutaminolysis and lipid synthesis 
were found in tumour tissue, and the close association between can- 
cer-cell metabolism and hypoxia was established (reviewed in ref. 2). 
Over the past decade, a more complex picture of cancer-cell 
metabolism has emerged. Many cancers show increased glucose 
uptake and enhanced glycolytic rates, suggesting that metabolic 
alteration provides a growth advantage for tumour cells’. Some of 
these changes are similar to the metabolic response of non-trans- 
formed cells to growth-promoting signals, so it is not entirely clear 
whether these metabolic alterations are specific to cancer or just 
reflect the increased proliferation of tumour cells. However, differ- 
ent oncogenic signalling pathways target distinct components of 
the metabolic network. Moreover, tumours with the same genetic 
lesions have different metabolic profiles depending on the tissue 
they arise in*, suggesting that the tissue environment strongly affects 
the metabolic activity of cancer cells. Altered metabolic activity is 
crucial for supporting uncontrolled proliferation, evasion of growth- 
inhibitory signals, cell migration and the dissemination of metastatic 
cells into distant tissues. However, metabolic reprogramming also 
renders cancer cells more susceptible to perturbations within the 
metabolic network. Identifying these metabolic dependencies could 
open a window of opportunity for therapeutic intervention. 


B y the mid-twentieth century, cancer cells were known to show 


Oncogenic signalling drives metabolic reprogramming 
Cancer cells need to generate large amounts of precursors for macro- 
molecule biosynthesis to allow the accumulation of biomass during 
cell growth and proliferation (Fig. 1). Enhanced uptake of glucose 
supports the production of intermediates for the synthesis of lipids, 
proteins and nucleic acids’. In addition, cancer cells have increased 
glutamine uptake and glutaminolysis, which replenish intermediates 
of the tricarboxylic acid (TCA) cycle that are redirected into biosyn- 
thetic reactions — a process known as anaplerosis’. 

Oncogenic signalling drives many of the same pathways that are 
responsible for the metabolic response of normal cells to growth- 
promoting signals. Activation of AKT by phosphatidylinositol- 3-OH 
kinase (PI(3)K) results in increased glucose uptake, enhanced activity 


and mitochondrial localization of hexokinase and increased glyco- 
lytic flux. The mammalian target of rapamycin complex 1 (mTORC1) 
and hypoxia-inducible factor (HIF) (discussed in more detail later) 
also contribute to the increased expression and activity of glycolytic 
enzymes’. Oncogenic levels of MYC have been linked to increased 
glutaminolysis through a coordinated transcriptional program that 
results in glutamine addiction of MYC-transformed cells’. MYC also 
promotes the alternative splicing of the pyruvate kinase gene PKM, 
resulting in enhanced expression of the embryonic isoform PKM2 
(ref. 8). PKM2 is highly expressed in rapidly proliferating tissues, 
and many cancer cells exclusively express this isoform. In contrast 
to other isoforms, PKM2 can switch from a tetrameric to a dimeric 
form with lower activity. This switch can be induced in response to 
tyrosine kinase signalling’ and allows the accumulation of glycolytic 
intermediates for biosynthetic processes. 

Tumour suppressor pathways also affect metabolism. For exam- 
ple, TP53 maintains mitochondrial activity through the expression 
of cytochrome c oxidase 2, and loss of this gene recapitulates the 
metabolic consequences of the Warburg effect’’. TP53 regulates 
glycolysis by inducing the expression of the TP53-induced glycoly- 
sis and apoptosis regulator (TIGAR), an enzyme with homology to 
fructose-2,6-bisphosphatase (ref. 11). Increased expression of this 
regulator inhibits glycolytic activity and increases the availability of 
glucose-6-phosphate (G6P) for entry into the oxidative arm of the 
pentose phosphate pathway (PPP) (Fig. 1), thereby supporting the 
production of riboses and NADPH for nucleotide biosynthesis as 
part of the DNA-damage response. However, p53 can also reduce the 
production of NADPH by inhibiting G6P dehydrogenase, the rate- 
limiting enzyme of this pathway’”. Recent evidence suggests that the 
metabolic functions of p53 may be essential for its role as a tumour 
suppressor, whereas other functions — including induction of cell- 
cycle arrest and apoptosis — are dispensable”. 


Mitochondrial metabolism 

Contrary to previously held views, most cancer cells retain func- 
tional mitochondria. Mitochondria are essential for the synthesis of 
citrate by the TCA cycle for the production of cytoplasmic acetyl- 
coenzyme A (CoA), a central source of acetyl groups for lipid syn- 
thesis and protein acetylation. A large fraction of nuclear-encoded 
mitochondrial genes are part of the transcriptional program 
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induced by MYC (ref. 2). MYC-transformed cells have been shown 
to be highly dependent on AMP-activated protein kinase (AMPK)- 
related kinase 5 (ARK5) (also known as NUAK1), which limits the 
activity of mTORCI and maintains the high respiratory capacity 
of these cells'*. However, oxidative mitochondrial metabolism can 
be impaired in cancer cells as a result of mutations in components 
of the TCA cycle or electron transport chain. Moreover, tumour 
hypoxia inhibits the entry of pyruvate into the TCA cycle and pre- 
vents the synthesis of citrate through this route. Under these condi- 
tions, reductive carboxylation of glutamine-derived a-ketoglutarate 
by the NADPH-dependent isoforms of isocitrate dehydrogenase, 
IDH1 and IDH2, is used to generate citrate for lipid synthesis’*"” 
(Fig. 1). IDH2 is predominantly located within the mitochondria, 
so mitochondrial function contributes to macromolecule biosyn- 
thesis in cancer cells. Mitochondria could also be required to restore 
cytoplasmic pools of NAD* through the malate-aspartate shuttle 
to support the high glycolytic flux of cancer cells. Therefore, mito- 
chondria can no longer be viewed as inactive bystanders but should 
be recognized as important organelles, which are actively involved 
in the transformation process by maintaining the biosynthetic 
capacity of cancer cells. 
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Bioenergetics and redox balance 

The increased biosynthetic activity of cancer cells requires not only 
enhanced uptake and conversion of nutrients, but also a corre- 
sponding increase in the production of NADPH as a reducing agent 
for anabolic reactions and to maintain cellular redox balance’®. The 
reduced enzymatic activity of PKM2 may allow the accumulation 
of glycolytic intermediates and promote the entry of G6P into the 
oxidative arm of the PPP for the production of NADPH (Fig. 1). 
Indeed, inhibition of PKM2 by direct modification in response to 
oxidative stress increases the production of NADPH and reduced 
glutathione”. Furthermore, enhanced expression of PFKFB4, an 
isoform of the bifunctional enzyme phosphofructokinase 2 (PFK2), 
is essential to balance glycolytic activity and NADPH synthesis for 
the production of anti-oxidants in prostate cancer cells’. However, 
different cancers may depend on additional pathways for NADPH 
production as KRAS expression in a pancreatic cancer model has 
been found not to increase activity of the oxidative PPP”’. Cytoplas- 
mic NADPH can also be produced by the oxidative decarboxylation 
of malate to pyruvate by malic enzyme 1 (ME1) and the conversion 
of citrate into a-ketoglutarate by IDH1. Although the exact con- 
tribution of these enzymes to NADPH production in cancer is not 
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Figure 1 | Overview of metabolic activities in cancer cells. The main 
metabolic pathways that contribute to the production of macromolecules in 
mammalian cells are nucleotide synthesis, the pentose phosphate pathway, 
serine synthesis, glutaminolysis, cholesterol synthesis, fatty-acid synthesis 
and elongation desaturation. Glycogen synthesis and pH regulation 
contribute to cellular bioenergetics. The enzymes involved in these pathways 
are shown in bold, those induced in response to hypoxia are marked with 

an asterisk. Metabolic enzymes in the TCA cycle, fumarate hydratase 

(FH) and succinate dehydrogenase (SDH), can act as tumour suppressors. 
2-hydroxyglutarate (2-HG) is produced from a-ketoglutarate (a-KG) by 

the mutant forms of isocitrate dehydrogenase 1 (IDH1) and IDH2 enzymes 
that are found in cancer (grey dashed arrow). Reductive carboxylation of 
a-KG by IDH1 and IDH2 produces citrate for lipid synthesis in hypoxic cells 
(black dashed arrow). ACC, acetyl-CoA carboxylase; ACLY, ATP citrate 
lyase; ACO, aconitase; CA9, carbonic anhydrase 9; CoA, coenzyme A; 
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CS, citrate synthase; FASN, fatty-acid synthase; F1,6BP, fructose-1,6- 
bisphosphate; F2,6BP, fructose-2,6-bisphosphate; F6P, fructose-6-phosphate; 
GLS, glutaminase; GLUD, glutamate dehydrogenase 1; GOT, glutamic- 
oxaloacetic transaminase; GSH, glutathione; G1P, glucose-1-phosphate; G6P, 
glucose-6-phosphate; G6PD, G6P dehydrogenase; HIF, hypoxia inducible 
factor; HMGCR, 3-hydroxy-3-methylglutaryl-CoA reductase; KEAP, 
kelch-like ECH-associated protein 1; LDHA, lactate dehydrogenase A; 

MCT, monocarboxylate transporters; MDH, malate dehydrogenase; 

MEI, malic enzyme 1; NHE1, Na’/H"* exchange protein 1; NRF2, nuclear 
factor (erythroid-derived 2)-like 2; PDH, pyruvate dehydrogenase; 

PDHK1, pyruvate dehydrogenase kinase; PEP, phosphoenolpyruvate; 

PFK, phosphofructokinase; PHD, prolyl hydroxylases; PHGDH, 
phosphoglycerate dehydrogenase; PKM2, pyruvate kinase M2; PL, 
phospholipids; ROS, reactive oxygen species; SCD, stearoyl-CoA desaturase; 
TAG, triacylglycerides. 
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fully understood, they may present attractive targets for selectively 
killing tumour cells with high biosynthetic activity. 


Lactate transport and pH regulation 

Enhanced uptake of glucose and its conversion into lactate creates 
the problem of intracellular acidification and lactate accumulation. 
Maintaining an alkaline intracellular milieu is essential for cancer- 
cell survival, whereas acidification of the extracellular micro-envi- 
ronment may be important to facilitate cancer-cell invasion and 
metastasis formation (reviewed in ref. 22). 

Three main acid-regulatory systems have been implicated in the 
pH regulation of cancer cells; these involve Na*/H* exchangers, car- 
bonic anhydrase 9 (CA9) and monocarboxylate transporters (Fig. 1). 
Regulation of intracellular pH by an isoform of an Na‘/H* exchanger 
protein (NHE1) is required for tumour growth, cell migration and 
metastasis formation”. CA9 is a target gene of HIF and prevents 
the acidification of cells under hypoxic conditions”. The catalytic 
domain of CA9 is located at the extracellular face of the plasma 
membrane and catalyses the conversion of membrane-permeant 
carbon dioxide into bicarbonate, thereby removing protons from 
the cell to maintain an alkaline intracellular milieu. 

Lactate transport across the plasma membrane is facilitated by 
monocarboxylate transporters and is coupled to the symport of 
protons”. Some monocarboxylate transporters require association 
with an ancillary protein (CD147, also known as basigin) for mem- 
brane localization and activity, and it has been shown that silenc- 
ing of CD 147 in cancer cells prevents transformation and tumour 
formation because of the inhibition of MCT1 and MCT4 function”. 
Notably, MCT4 is highly overexpressed in renal cancer and deple- 
tion of MCT4 causes accumulation of lactate, acidification and cell 
death in renal cancer cells”’. Renal cancer is associated with loss of 
the von Hippel-Lindau tumour suppressor protein (pVHL), result- 
ing in stabilization and activation of the a-subunits of HIF (HIFla 
and HIF2a). The dependency of renal cancer cells on MCT4 is likely 
to be caused by their high glycolytic activity owing to a pseudohy- 
poxic state. However, the complete picture of lactate transport in 
cancer may be more complex. Tumour-derived lactate can be taken 
up and oxidized by stromal cells”’, and there is evidence that cells 
within oxygenated areas of solid tumours may use lactate produced 
by hypoxic tumour cells”. This use increases glucose availability and 
supports the survival of cells within the hypoxic part of the tumour. 
Nevertheless, monocarboxylate transporters are attractive targets 
for cancer therapy, and inhibitors of MCT1, such as AZD-3965, are 
currently being tested in clinical trials. 


The role of lipid synthesis in cancer 
In addition to glucose and glutamine metabolism, the increased 
biosynthesis of macromolecules — particularly lipids — has been 
recognized as a component of the metabolic reprogramming in can- 
cer cells”. Although most cells in the adult body rely on lipids from 
the bloodstream, many cancer cells show a reactivation of de novo 
fatty-acid synthesis” (Fig. 1). Expression of enzymes involved in 
the synthesis of cholesterol and fatty acids is controlled by the sterol 
regulatory element binding proteins (SREBPs). These proteins are 
activated by AKT in an mTORC1-dependent manner, and SREBP 
target genes represent one of the main components of the transcrip- 
tional program downstream of mTORCI (refs 31, 32). Inhibition of 
SREBP function affects cell and organ size in fruitflies (Drosophila 
melanogaster) suggesting that lipid synthesis is essential for cell 
growth”. Furthermore, increased de novo fatty-acid synthesis, incor- 
poration of newly synthesized lipids into phosphoglycerides and 
enhanced expression of SREBP1 have been shown to correlate with 
breast cancer progression’. 

Although the exact role of lipid synthesis in cancer is not fully 
understood, it is likely that de novo lipogenesis contributes to the 
generation of structural lipids, such as sterols and phosphoglycerides 
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that are required for the generation of biological membranes. Tria- 
cylglycerides are stored in lipid droplets and can be used to generate 
energy, whereas some lipids act as second messengers and could 
contribute to signalling processes in cancer cells. Indeed, monoacyl- 
glycerol lipase, an enzyme that is overexpressed in aggressive can- 
cers in humans, induces a specific lipid signature that could trigger 
signalling events that are involved in cell migration and invasion™. 

Enhanced expression of enzymes within the cholesterol biosynthe- 
sis (mevalonate) pathway has been shown to induce breast epithelial 
cells to form three-dimensional structures that may represent early 
stages of cancer”. p53 was found to associate with SREBPs to drive 
the expression of these genes”*. The cholesterol biosynthesis pathway 
also provides intermediates for protein isoprenylation and loss of 
retinoblastoma protein results in enhanced prenylation of NRAS 
through activation of SREBP”**. 

The rate-limiting enzyme of this pathway, 3-hydroxy-3-methyl- 
glutaryl-CoA synthase (HMGCR), is the molecular target of statins 
— a group of widely used cholesterol-lowering drugs. HMGCR 
promotes cancer-cell proliferation and can cooperate with RAS to 
transform mouse embryo fibroblasts, suggesting that this pathway 
is crucial for cancer development”. A recent meta-analysis” failed 
to establish any beneficial effects of statin use on cancer incidence. 
However, altered study design, improved pharmacokinetics or the 
combination of statins with other chemotherapeutic agents may yet 
demonstrate their potential value for cancer therapy. 

Lipid synthesis may also have a non-cell-autonomous role in can- 
cer development. Adipocytes promote growth and metastasis forma- 
tion of ovarian cancer cells and provide them with lipids for energy 
generation”. It is also possible that lipogenesis in cancer cells could 
support the growth of cells located within nutrient-limited areas, 
thereby contributing to symbiotic relationships within tumours. 


Surviving the cancer environment 

The micro-environment of many solid tumours is characterized by 
limited oxygen availability as a result of the distance to the vas- 
culature. HIF drives metabolic adaptation to hypoxic conditions 
by inducing a distinct transcriptional program”. HIF induces the 
expression of glucose transporters and glycolytic enzymes, includ- 
ing glucose transporter 1 and 3 (GLUT1 and GLUT3), hexokinase 2 
and some isoforms of PFK2 (ref. 40). HIF also prevents the entry of 
pyruvate into the TCA cycle by inducing pyruvate dehydrogenase 
kinase 1 (PDHK1) (ref. 41, 42) and lowers cellular respiration by 
regulating cytochrome c oxidase isoform expression and inhibiting 
mitochondrial biogenesis”. 

Other metabolic constraints imposed by the in vivo tumour micro- 
environment are less well studied. Glucose starvation can select for 
oncogenic mutations in KRAS and thus promote cell transforma- 
tion”. In vivo distribution of other metabolites, including lipids and 
lipoproteins, could also significantly affect tumour-cell survival and 
cancer development. Metabolite analysis by magnetic resonance 
spectroscopy (MRS) and the use of labelled metabolites as tracers 
for positron emission tomography (PET) are required to establish a 
more complete picture of the metabolic profile of cancer cells within 
a solid tumour (Box 1). For example, stable isotope labelling, cou- 
pled with nuclear magnetic resonance analysis, has demonstrated 
that orthotopically implanted human glioblastoma cells use glucose 
— rather than glutamine — to produce TCA-cycle intermediates™. 
Analysis of cells in the context of intact tissues is clearly required to 
fully understand the effect of the tumour micro-environment on the 
metabolic activity of cancer cells. 


Oxidative stress and transformation 

Metabolic processes have an essential role in the regulation of cel- 
lular redox balance. The mitochondrial respiratory chain is the main 
source of free radicals, mainly through the production of superox- 
ide radicals by complex I and III. Although superoxide produced 
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BOX1 


Alterations in the metabolic activity of cancer cells can be determined 
using various techniques. Cancer ‘metabolomics’ usually describes 
the comprehensive analysis of the full metabolite composition of 
cancer cells or tumours, whereas cancer ‘metabonomics’ is often 
used to describe the response of a system to perturbation, such 
as drug treatment. Mass spectrometry allows the detection of a 
large number of cellular metabolites with high accuracy. It is often 
combined with initial separation of complex samples by gas or liquid 
chromatography, or capillary electrophoresis. To identify individual 
metabolites, mass-spectrometry spectra are compared with data 
from spectral libraries or reference compounds. However, it is often 
difficult to identify metabolites that produce overlapping spectral 
peaks. For accurate metabolite quantification, samples can be spiked 
with control compounds to determine the efficiency of detection. The 
technical advances of the past decade have meant the quantitative 
detection of hundreds or even thousands of metabolites is now 
routinely possible. 

Another technique that has received much interest is the 
determination of metabolic activity using stable isotope tracing. 
This technique provides a dynamic representation of the activity 
of different metabolic processes. Cells are labelled with modified 
compounds (usually nutrients, such as glucose or glutamine) in 
which one or several carbon atoms have been replaced with a heavy 
isotope®®. Metabolites that incorporate the heavy isotope can be 
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detected over time by mass spectrometry (through a characteristic 
shift in their mass) or nuclear magnetic resonance (NMR) 
spectroscopy. 

NMR spectroscopy can also be applied to detect metabolite 
levels in intact tissues or even in tumours in vivo. This generally 
detects natural abundant heavy isotopes (carbon-13, hydrogen-1, 
phosphorus-31), but can also be used to follow the metabolism 
of labelled synthetic metabolic substrates“*. One limitation is that 
substrates have to be applied at physiological concentrations, rather 
than as tracers. However, using hyperpolarized NMR techniques can 
increase sensitivity®. 

Another imaging strategy to determine the metabolic activity of 
tumours is positron emission tomography (PET). This technique 
detects the accumulation of tracers within the tumour tissue. 
Fluorodeoxyglucose PET imaging detects primary tumours and 
metastases owing to their high rate of glucose uptake. Tracers based 
on other metabolites, such as acetate, glutamine, thymidine or glycine, 
are in development. 

In parallel to analytical advances, mathematical tools for the analysis 
of metabolism are being developed. Global reconstructions of human 
metabolic networks are available’, and metabolic flux analysis can 
be used to produce quantitative models of metabolic activity. Together, 
these tools will generate a more comprehensive understanding of the 
metabolic reprogramming in cancer. 


within the mitochondrial matrix can be detoxified through the 
actions of superoxide dismutase and catalase, free radicals released 
into the intermembrane space can contribute to the generation of 
cytoplasmic reactive oxygen species (ROS). Mitochondrial ROS pro- 
duction is required for RAS-dependent cell transformation® and 
contributes to hypoxic stabilisation of HIF”. Inhibition of MYC- 
dependent expression of mitochondrial genes by FOXO3a modu- 
lates ROS metabolism and prevents HIF stabilisation in hypoxic 
cells*”. However, expression of physiological levels of oncogenic 
KRAS (KRAS°””) in mouse embryo fibroblasts actually lowers ROS 
levels by activating the nuclear factor (erythroid-derived 2)-like 2 
NREF2, which is involved in the regulation of enzymes that detox- 
ify free radicals”. 

ROS levels can directly affect the metabolic activity of cancer 
cells. Inhibition of PKM2 by ROS-dependent covalent modifica- 
tion of a cysteine residue supports the production of NADPH for 
anti-oxidant synthesis by allowing the accumulation of glycolytic 
intermediates”. However, excess ROS production leads to cell death 
and limits tumour growth. Cancer cells need an optimal level of ROS 
that supports cancer-promoting signalling functions, but does not 
lead to irreversible oxidative damage that results in cell death or 
senescence. Tipping this balance could offer routes into developing 
cancer therapeutics. 


Metabolic oncogenes and tumour suppressors 

In addition to the role metabolic alterations have in facilitating the 
growth-promoting response to oncogene activation, they can also 
actively drive the transformation process. This revised view of cancer 
metabolism emerged from the recognition that metabolic enzymes 
are themselves subject to genetic alterations in cancer. 


Mitochondrial tumour suppressors 

The discovery that inherited mutations in genes that encode suc- 
cinate dehydrogenase (SDHB, SDHC and SDHD) or fumarate 
hydratase (FH) are associated with familial cancer syndromes raised 


interest in the role of these genes as tumour suppressors (reviewed 
in ref. 50). The enzymes encoded by these genes are components of 
the TCA-cycle. 

Inhibition of succinate dehydrogenase leads to the inhibition of 
prolyl hydroxylases by the accumulation of the TCA cycle interme- 
diate succinate. Prolyl hydroxylases are responsible for the hydrox- 
ylation of HIF on two proline residues, which labels the protein 
for VHL-dependent ubiquitylation and subsequent degradation in 
tissues with a normal oxygen concentration. Prolyl hydroxylases 
use the oxidative decarboxylation of a-ketoglutarate to transfer a 
hydroxyl group onto their substrates. This reaction is inhibited in 
the presence of succinate or fumarate and can result in the accu- 
mulation of prolyl hydroxylase substrates, including HIF”. Loss of 
succinate dehydrogenase or fumarate hydratase is associated with 
certain forms of hereditary renal cancer, and accumulation of succi- 
nate as a result of feedback inhibition and stabilization of HIF could 
lead to cancer development. Recent evidence suggests that renal 
cyst formation after Fh1 deletion is independent of HIF, but involves 
activation of the NRF2 pathway by fumarate, and that activation of 
NRF2 may contribute to the development of fumarate-hydratase- 
deficient cancers*'. However, altered epigenetic regulation owing 
to the inhibition of other a-ketoglutarate-dependent dioxygenases 
(including histone and DNA demethylases, discussed in more detail 
later) may also contribute to tumorigenesis in these syndromes”. 


Oncogenic mutation of metabolic enzymes 

Somatic mutations in IDH1 and IDH2 have been found at high fre- 
quency in secondary glioblastoma®™ and in acute myeloid leukae- 
mia (AML)”’. These mutations always cause a single amino-acid 
change in one of the two alleles of either gene (arginine 132 inIDH1 
or arginine 172 in IDH2). Initially, mutation of IDH1 in glioma 
was thought to lead to dominant-negative inhibition of the wild- 
type protein and cause activation of HIF through the decreased 
availability of a-ketoglutarate*’. However, mutant IDH proteins 
have been subsequently shown to acquire a neomorphic enzymatic 
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Figure 2 | Epigenetic regulation by 2-hydroxyglutarate. Mutant forms 

of isocitrate dehydrogenase (IDH) 1 or 2 found in cancer produce the 
oncometabolite 2-hydroxyglutarate (2-HG) through a neomorphic 
enzymatic activity. 2-Hydroxyglutarate inhibits DNA demethylases, 
including TET2, inhibiting haematopoietic differentiation”. Inhibition of 
DNA demethylases by 2-deoxyglucose is also associated with increased 
genome-wide DNA methylation and a CpG island methylator phenotype 

in glioma®*"'. 2-Hydroxyglutarate inhibits histone demethylases, including 
lysine-specific demethylase 4C (KDM4C), lysine-specific demethylase 

7A (KDM7A) and lysine-specific demethylase 4A (KDM4A). This causes 
increased histone methylation in glioma cells” and leads to the inhibition of 
expression of lineage-specific differentiation genes in primary astrocytes™. 
The (R)-enantiomer of 2-hydroxyglutarate stimulates the activity of the 
prolylhydroxylase EGLN 1, which results in enhanced degradation of HIFla 
and promotes transformation in human astrocytes”. a-KG, a-ketoglutarate. 


activity and catalyse the conversion of a-ketoglutarate to 2-hydrox- 
yglutarate’’. Indeed, increased levels of 2-hydroxyglutarate have 
been found in IDH-mutant tumours”. This discovery suggested 
that small molecules could potentially be drivers of cell transfor- 
mation. The term oncometabolite was coined when it was shown 
that 2-hydroxyglutarate modulates gene expression in cancer cells 
by affecting epigenetic regulation (Fig. 2). Mutations of IDH1 and 
IDH2 were found to be associated with high levels of DNA methyla- 
tion in AML (ref. 60). Accumulation of 2-hydroxyglutarate inhibits 
the function of a-ketoglutarate-dependent dioxygenases, includ- 
ing tet methylcytosine dioxygenase 2 (TET2), a DNA demethylase 
involved in epigenetic regulation”. Interestingly, mutations of TET2 
in AML are restricted to patients that carry no IDH mutations”. 
2-Hydroxyglutarate can also affect histone modification by regu- 
lating the Jumonji-domain-containing protein 2A (JMJD2A, also 
known as KDM4A)”. IDH1 mutations have also been associated 
with a hypermethylator phenotype in glioma®, and expression of 
mutant IDH1 has been shown to lead to the repression of lineage- 
specific differentiation genes in primary astrocytes™ (Fig. 2). These 
findings suggest that accumulation of 2-hydroxyglutarate as a result 
of IDH mutation keeps cells in an undifferentiated, or stem-cell-like, 
state that may be more permissive for transformation. Moreover, a 
conditional knock-in of mutant IDH1™’™ caused an expansion of 
early haematopoietic progenitors and was associated with methyla- 
tion changes similar to those observed in human AML”. 
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2-Hydroxyglutarate produced by mutant IDH consists exclusively 
of the (R)-enantiomer (one of two possible stereoisomers). (R)- 
2-Hydroxyglutarate was found to stimulate the activity of the prolyl 
hydroxylase EGLN1 in human astrocytes (Fig. 2). This resulted in 
the degradation of HIF and the promotion of cell transformation”. 
Although this result was initially surprising, it was supported by the 
observation that IDH-mutant gliomas express lower levels of HIF 
target genes, suggesting that HIF has a tumour suppressive role in 
this subset of tumours”. 

Many cellular processes are highly dependent on the availability of 
metabolites, so metabolic reprogramming may have even more of a 
marked effect in cancer cells. Post-translational modification of pro- 
teins — particularly histones — through acetylation requires acetyl- 
CoA, and inhibition of acetyl-CoA production can affect histone 
acetylation in response to growth-factor stimulation”. One exam- 
ple of this effect is the sirtuins, a class of NAD*-dependent protein 
deacetylases, which are important regulators of energy metabolism 
and stress resistance. Changes in the availability of acetyl-CoA or 
NAD* caused by metabolic reprogramming in cancer may thus affect 
numerous cellular processes, including gene expression. 

The complex role of 2-hydroxyglutarate suggests that altered 
metabolic activity can have extensive effects on transcriptional and 
epigenetic regulation in cancer cells. The analysis of complete cancer 
metabolomes will probably uncover additional oncometabolites with 
similarly complex functions. 


Getting to the clinic 

Metabolic reprogramming in cancer is now widely recognized as 
important in offering opportunities for cancer treatment; however, 
strategies that target the enhanced glycolytic activity in cancer have, 
so far, not been very successful as treatments (reviewed in refs 68 
and 69). The early antimetabolite 2-deoxyglucose had only limited 
clinical success as a single-agent treatment, mainly because of a lack 
of efficacy. One mechanism of resistance to this compound is the 
induction of autophagy, a mechanism of self-degradation of cellular 
components. In addition, dose-limiting toxicities that affect brain 
and cardiac function have been observed, reflecting drug uptake by 
normal tissue in these key organs”. 

There is a clear need to explore other metabolic pathways, and 
potential synthetic lethality, that may be more successful. The 
increased metabolic activity of cancer cells, which is often associated 
with hypoxia, should render them selectively sensitive to perturba- 
tions within the metabolic network. 


Finding new targets 

Metabolic reprogramming may render cancer cells highly dependent 
on specific metabolic enzymes or processes that could be exploited 
for cancer therapy. However, the search for suitable targets may be 
complicated by the high plasticity of the metabolic network that can 
induce compensatory biosynthetic routes to generate the limiting 
metabolites, as well as the exchange of metabolites between cancer 
cells and the surrounding tissues. Several studies have used RNA 
interference screening tools to identify metabolic weaknesses in 
cancer cells. One study identified phosphoglycerate dehydrogenase, 
which catalyses the first committed step within the serine biosyn- 
thesis pathway, as essential for the in vivo growth of breast cancer 
cells’'. The gene encoding this enzyme, PHGDH, lies in a region of 
frequent copy-number gain in breast cancer and melanomas, sug- 
gesting that this enzyme has an important role in the development 
of these cancers. An independent study found that phosphoglycerate 
dehydrogenase is responsible for the enhanced diversion of carbon 
into the serine biosynthesis pathway in cancer cells”. Increased ser- 
ine biosynthesis could provide additional a-ketoglutarate for ana- 
plerosis in cancer cells or be involved in the generation of glycine 
for nucleotide biosynthesis and cysteine for the production of glu- 
tathione. Enhanced flux of glycolytic intermediates into the serine 
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biosynthesis pathway, owing to the reduced pyruvate kinase activity 
in PKM2-expressing cancer cells, can also be important to maintain 
mTORC1 activity under conditions of serine depletion”. A study 
using a screening approach identified an isoform of phosphofruc- 
tokinase 2, called PFKFB4, as an important enzyme for the survival 
of prostate cancer cells”. 

An additional challenge is to identify those metabolic processes 
that specifically support the survival of cancer cells that carry 
defined oncogenic drivers. For example, AMP-activated kinase 
family member 5 (ARKS5) was found to be essential for the viability 
of cells that expressed oncogenic levels of MYC (ref. 14). ARK5 was 
required to limit mTORC1-dependent protein synthesis and main- 
tain mitochondrial activity and glutamine metabolism. Another 
study used a chemical synthetic-lethal screen to show that inhibi- 
tion of GLUT! selectively kills VHL-deficient renal cancer cells”. 

Metabolic weaknesses in cancer can also be predicted using an 
in silico approach. This uses stoichiometric models of metabolic net- 
works coupled to metabolic flux balance analysis and constraint- 
based modelling to generate models of cancer metabolism and to 
predict which metabolic genes are essential for cancer-cell survival”. 
This method was used to identify metabolic pathways that are essen- 
tial to support the viability of FH1-mutant cancer cells (synthetic 
lethal)”. A highly parallel metabolomics approach established cel- 
lular consumption and release profiles of more than 200 metabolites 
across the NCI-60 panel (the National Cancer Institute’s collection 
of 60 human cancer cell lines)”’. This study reported that glycine 
consumption correlates with cellular proliferation rate, and sug- 
gests that targeting glycine metabolism could selectively compro- 
mise nucleotide biosynthesis in rapidly proliferating cancer cells. 
These approaches have the potential to differentiate between com- 
mon events that are essential for the metabolic reprogramming of 
most cancer types and specific metabolic alterations that apply only 
to cancers from a specific tissue or genetic background. Improved 
analytical capacity and high-throughput screening will continue to 
provide insight into the complexity of cancer metabolism. 


Patient selection 
Selecting the patients who are most likely to benefit from therapies 
that target cancer metabolism is an additional challenge. One pos- 
sibility is to stratify patients on the basis of genetic drivers. Metabolic 
sensitivities in cancer can depend on the activation state of specific 
oncogenes. The genetic complexity of cancer is, therefore, also likely 
to be reflected in the cells’ specific metabolic requirements. Under- 
standing this complexity is essential for identifying patients who are 
most likely to benefit from a specific treatment. For example, loss of 
p53 sensitizes cells to metformin, an inhibitor of mitochondrial ATP 
production and an activator of AMPK”. Metformin enhances the use 
of fatty acids for energy production and triggers autophagy”®. Both 
processes can be used to provide energy and promote cell survival 
when nutrients are scarce, but they rely on functional p53. 

Another oncogenic driver that is frequently activated in human 
cancer is KRAS. Expression of oncogenic KRAS was found to be 
essential for tumour maintenance in a genetic model of pancre- 
atic ductal adenocarcinoma”. Inhibition of KRAS expression was 
accompanied by specific metabolic alterations that demonstrated 
the role of KRAS in glycosylation by the induction of the hexosa- 
mine biosynthesis pathway and nucleotide biosynthesis through the 
non-oxidative arm of the PPP”’. Treatments that target these path- 
ways may only be effective when this oncogenic driver is present 
and could be offered to patients who carry this mutation. Other 
oncogenic lesions with a strong metabolic signal include activation 
of PI3K (also known as PIK3CA), AKT and MYC and loss of VHL. 
Alterations in these genes should be considered when assessing 
patient responses to targeted treatment. 

The metabolic profile of tumours depends not only on the type of 
genetic lesion but also on the tissue in which the mutation arises*. 
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This is further complicated by the increasingly recognized genetic 
intratumour heterogeneity of solid tumours”. However, because the 
metabolic state of cancer cells is strongly affected by the tumour 
micro-environment, biomarkers for tumour hypoxia could be used 
for patient selection. Several hypoxia gene signatures have been 
published, which include many of the genes that are involved in 
glycolytic metabolism. These have been useful in classifying patients 
who are likely to benefit from radiotherapy in combination with 
the hypoxic radiosensitizer nimorazole™. Similarly, tumours that 
are already hypoxic may be more sensitive to further deprivation of 
oxygen through anti-angiogenic therapy. Importantly, in vivo assess- 
ment of tumour hypoxia by fluoride-18-fluoroazomycin arabino- 
side ('"F-FAZA) or misonidazole scans could be vital to select those 
tumours that may be particularly sensitive to these therapies and to 
monitor treatment response. 


Targeting the tumour stroma 

The cells of a tumour are strongly influenced by its stromal com- 
ponent. Evidence suggests that there is an exchange of metabolites 
between cancer and stromal cells to provide nutrients*’. Stromal cells 
can have an important role in ROS metabolism within the tumour 
compartment™. This may be particularly important in maintaining 
the replicative potential of cancer stem cells. Although the metabolic 
requirements of cancer stem cells have not been investigated, early 
stem-cell development involves a metabolic switch to glycolysis that 
is reminiscent of the Warburg effect in cancer®’. Cancer stem cells 
require an environment low in ROS and may therefore be highly 
dependent on specific metabolic activities. Metabolic perturbations 
that selectively target cancer stem cells may be particularly effective 
for improving therapeutic response and preventing cancer recur- 
rence*!, However, suitable molecular markers are required to identify 
different types of stroma and to predict treatment response. 


Cancer diagnostics and dynamic monitoring of therapy 
Metabolic reprogramming in cancer has already been exploited for 
cancer diagnosis and to monitor treatment response (Box 1). Meta- 
bolic processes that are highly active in cancer cells can produce 
specific by-products that can be detected not only in tumour biop- 
sies but also in blood or urine samples*’. Enhanced glucose uptake 
forms the basis of tumour imaging by fluorodeoxyglucose- PET 
(FDG-PET) and can be used as an early indicator of drug efficacy™. 
However, not all tumours can be detected using FDG-PET imaging. 
The development of molecular tracers based on other metabolites, 
such as glutamine, acetate, thymidine or glycine, will offer the pos- 
sibility of profiling the metabolic state of individual tumours and of 
monitoring the alterations to their metabolic state during treatment. 
Nuclear magnetic resonance (NMR) spectroscopy offers a non-inva- 
sive method for the detection of selected metabolites in vivo and can 
offer insight into the body’s metabolic response to targeted therapy”. 
Importantly, changes in metabolic activity may occur quite rapidly in 
response to therapy, providing the opportunity to revise therapeutic 
strategies or add agents. In vivo imaging of carbon-13-labelled glu- 
cose by dynamic MRS has elucidated many aspects of human brain 
glutamine and glucose metabolism, and could be used to monitor 
rapid changes in response to therapy®*. These should be integrated 
with the use of drugs to target relevant metabolic pathways (for 
example, glutaminase inhibitors). 


Sparing normal tissue and avoiding toxicity 

Several compounds that specifically target cancer metabolism are 
already approved or under clinical trial, and many more are in 
preclinical development®”’. One of the greatest challenges that is 
faced during the development of therapeutic strategies to target 
cancer metabolism is the possibility of toxic effects on non-can- 
cerous tissues. Toxic side effects may be restricted to metabolic 
tissues, such as liver, but may also have marked effects on whole 
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Figure 3 | Exploiting metabolic reprogramming for cancer therapy. a, The 
increased biosynthetic activity of cancer cells renders them highly susceptible 
to inhibitors of anti-oxidant synthesis. Inhibition of NADPH production by 
the pentose phosphate pathway (PPP) or disruption of glutathione (GSH) 
synthesis results in increased reactive oxygen species (ROS). b, Reactivation of 
mitochondrial pyruvate metabolism by inhibition of pyruvate dehydrogenase 
kinase 1 (PDHK1) with dichloroacetate (DCA) in hypoxic cancer cells or after 
anti-angiogenic therapy (for example, bevacizumab) can lead to increased 
production of ROS and cell death. When pyruvate dehydrogenase (PDH) 

is inhibited, pyruvate is converted into lactate by lactate dehydrogenase A 
(LDHA). LDHA is also induced by hypoxia. c, Activators (such as TEPP- 

46) of the pyruvate kinase M2 (PKM2) increase glycolytic flux and reduce 
lactate production in cancer cells. This depletes glycolytic intermediates 
required for the serine biosynthesis pathway and reduces the availability 

of glucose-6-phosphate (G6P) for entry into the PPP. d, Metformin blocks 
mitochondrial ATP production, resulting in activation of AMP-activated 
protein kinase (AMPK) and increased glycolysis. Inhibition of purine 


body metabolism. For example, inhibitors of fatty-acid synthase 
decreased body weight by affecting the hormonal control of food 
intake in mice”. 

Whether metabolic reprogramming in cancer is intrinsically dif- 
ferent from the metabolic response to proliferative stimuli in non- 
transformed cells is not clear. Indeed, the proliferative response to 
receptor ligation in T cells involves increased nutrient uptake and 
enhanced glycolysis and is mediated by signalling pathways that are 
frequently activated in cancer, including those of PI(3)K, MYC and 
HIF (ref. 91). Moreover, some metabolic activities in cancer cells, 
such as increased glycine metabolism, are strongly correlated with 
cell proliferation”’. Inhibition of these processes could also affect 
other proliferative tissues. 

However, because metabolic reprogramming in cancer ren- 
ders cells highly dependent on ROS metabolism, disrupting the 
production of antioxidants could provide an effective treatment 
strategy. Considering the important role of glutathione in ROS 
detoxification, drugs that block glutathione synthesis, which were 
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metabolism by antimetabolites such as pemetrexed causes the accumulation 
of aminoimidazole carboxamide ribonucleotide (ZMP), a cell-intrinsic 
activator of AMPK. Combination of metformin or pemetrexed with the 
glycolysis inhibitor 2-deoxyglucose (2-DG) prevents the metabolic adaptation 
to AMPK activation. e, Activation of hypoxia-inducible factor (HIF) in 
response to anti-angiogenic drugs decreases mitochondrial oxidation of 
pyruvate through induction of PDHK1, increases the conversion of pyruvate 
to lactate by inducing expression of LDHA and regulates intracellular pH by 
inducing carbonic anhydrase 9 (CA9). Combining anti-angiogenic drugs 
with inhibitors of HIF or inhibitors of its downstream targets (LDHA and 
CAQ) blocks the adaptive response to hypoxia. f, Poly- ADP-ribose polymerase 
(PARP) is activated in response to DNA damaging agents and requires NAD* 
as a cofactor. Nicotinamide phosphoribosyltransferase (NAMPT) restores 
cellular NAD* pools through the salvage pathway and is required to maintain 
glycolytic activity in the presence of LDHA inhibitors. Inhibitors of NAMPT 
may also synergize with PARP inhibitors, particularly in the treatment of 
tumours carrying mutations in DNA-damage repair-pathways. 


originally developed to overcome drug resistance (for example, 
L-buthionine-(S,R)-sulfoximine) should be re-evaluated (Fig. 3a). 
Furthermore, chemotherapeutics that induce oxidative stress could 
be combined with strategies to block NADPH production to 
achieve synergistic effects. 


Reactivating a suppressed pathway 

Particularly relevant is the switch to glycolysis from oxidative phos- 
phorylation by hypoxic induction of PDHK1. PDHK1 is inhibited by 
dichloroacetate. In patients with glioblastoma, this drug was shown 
to reactivate mitochondrial function and generate free radicals, 
which were toxic to tumour growth”. A combination of this class of 
drug with drugs that are activated by hypoxia or those that induce 
hypoxia could result in synergistic effects (Fig. 3b). Of note was the 
anti-angiogenic effect after treatment with dichloroacetate”. Some 
of the changes associated with metabolic reprogramming in cancer 
are also recapitulated in the tumour endothelium”; this tissue may 
be targeted using similar therapeutic strategies. Another strategy 
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involving the reactivation of a suppressed pathway would be to reac- 
tivate PKM2 in cancer cells using small molecules (Fig. 3c). One of 
these activators, TEPP-46, decreased the growth of non-small-cell 
lung cancer cells in a xenograft model”. 


Blocking the escape route 

Metformin is being widely investigated for prevention and treatment 
of cancer. The drug’s main action is through the non-reversible inhi- 
bition of complex I of the mitochondrial respiratory chain, leading 
to a reduction in the ATP:AMP ratio and activation of AMPK. Ina 
preclinical study, a combination of metformin and 2-deoxyglucose 
was effective in a wide range of tumour types”. Antimetabolites that 
inhibit purine metabolism, such as pemetrexed, result in accumula- 
tion of aminoimidazole carboxamide ribonucleotide, an endogenous 
analogue of AMP and activator of AMPK”, and provide the option 
of combining the inhibition of glycolysis and purine metabolism for 
synergistic effects (Fig. 3d). 

Hypoxia is rapidly induced by anti-angiogenic therapy that targets 
vascular endothelial growth factor (VEGF) and by vascular targeting 
agents. Thus, further induction of many of these metabolic pathways 
will occur, potentially contributing to the survival of tumour cells 
and their resistance to therapy. Using induced hypoxia to synergize 
with other drugs, by targeting either HIF itself or the key down- 
stream survival pathways, such as CA9, has proven effective in pre- 
clinical models (Fig. 3e). The early assessment of degree of hypoxia 
induction using PET imaging or CA9 detection could help to classify 
the responses for personalized intervention. 


Synthetic lethality 

The most likely way to produce anticancer effects is synthetic 
lethality, and this follows well-established principles in antibiotic 
therapy or their combination with chemotherapy. A clear example 
of synthetic lethal effects is the combination of lactate dehydro- 
genase A (LDHA) inhibitor with a drug that blocks the synthesis 
of NAD* through the salvage pathway. NAD* can be synthesised 
de novo or recycled from nicotinamide through the salvage path- 
way, involving the enzyme nicotinamide phosphoribosyltransferase 
(NAMPT). Inhibition of LDHA results in the depletion of NAD* 
— which is crucial to maintain glycolytic flux — and inhibition of 
NAMPT enhances the effectiveness of LDHA inhibitors” (Fig. 3f). 
NAMPT inhibitors may also be useful in combination with classi- 
cal chemotherapeutic agents. Genotoxic damage caused by ionizing 
radiation or DNA-alkylating drugs leads to the activation of poly- 
(ADP-ribose) polymerase (PARP) and can rapidly deplete cellular 
pools of cofactor NAD*. Inhibitors of NAMPT block the restoration 
of NAD* pools and could increase the toxicity of DNA-damaging 
agents. Inhibition of NAMPT could also be combined with PARP 
inhibitors, which have been proven to be useful in the treatment 
of tumours that carry mutations in DNA-repair pathways, such as 
BRCA-mutant breast cancers. NAMPT inhibitors are currently being 
tested in phase II clinical trials. 


Future research 

Elucidating the complex interplay between oncogenic signalling path- 
ways and cellular metabolic activity is an exciting challenge for future 
research. Metabolic reprogramming of cancer cells can clearly not sim- 
ply be explained by a shift from oxidative phosphorylation to aerobic 
glycolysis. However, relatively little is known about the differences in 
metabolic dependencies of genetically diverse cancer cells or the com- 
plex metabolic interactions between tumour and stroma. Although 
the importance of ROS metabolism in cell transformation and tumour 
maintenance is becoming more evident, the relative contribution of 
different metabolic pathways to anti-oxidant production in cancer is 
not fully understood. Many metabolic pathways involved in the repro- 
gramming of cancer cells are closely linked to the metabolic changes 
associated with hypoxia. Future research should address how cancer 
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cells maintain the balance between enhanced biosynthetic activity and 
the need for antioxidant production. Disrupting this balance should 
selectively impair the viability of cancer cells and, together with appro- 
priate biomarkers and dynamic cancer imaging, provide new strategies 
for the treatment of cancer. m 
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Mitochondrial disorders as windows 
into an ancient organelle 


Scott B. Vafai>? & Vamsi K. Mootha!?" 


Much of our current knowledge about mitochondria has come from studying patients who have respiratory chain 
disorders. These disorders comprise a large collection of individually rare syndromes, each presenting in a unique and 
often devastating way. In recent years, there has been great progress in defining their genetic basis, but we still know 
little about the cascade of events that gives rise to such diverse pathology. Here, we review these disorders and explore 
them in the context of a contemporary understanding of mitochondrial evolution, biochemistry and genetics. Fully 
deciphering their pathogenesis is a challenging next step that will inspire the development of drug treatments for rare 


and common diseases. 


he field of mitochondrial medicine began in 1959 when 

Swedish endocrinologist Rolf Luft and his colleagues 

described the case of a young woman with euthyroid hyper- 
metabolism, which was characterized by profuse sweating and 
weight loss despite high calorie intake’. Muscle biopsy and enzyme 
analysis — which are now a linchpin for the diagnosis of these dis- 
orders — revealed an uncoupling of mitochondria in the patient. 
Although described decades ago, the condition has been reported 
only once more’ and the root cause of Luft’s disease remains 
a mystery. 

Since the initial case report, more than 150 distinct genetic mito- 
chondrial syndromes have been defined. The largest subset arises 
from lesions that influence the function of the respiratory chain, 
which affect at least 1 in 5,000 live births’ and are the focus of this 
Review. These diseases can present either in infancy or adulthood, 
and in a multisystemic or highly tissue-specific manner. Signature 
traits can include lactic acidosis, skeletal myopathy, deafness, blind- 
ness, subacute neurodegeneration, intestinal dysmotility and periph- 
eral neuropathy. Most organ systems can be affected or spared, in 
varying combinations (Fig. 1). 

The clinical features of mitochondrial disorders have been 
reviewed in detail*®, but select examples are illustrative. Leber’s 
hereditary optic neuropathy (LHON) — which is the result of 
point mutations in a ubiquitously expressed mitochondrial DNA 
(mtDNA)-encoded respiratory chain protein — demonstrates 
remarkable tissue specificity, with patients developing sudden vision 
loss as young adults. By contrast, point mutations in an mtDNA- 
encoded transfer RNA cause mitochondrial myopathy, encepha- 
lopathy, lactic acidosis and stroke-like episodes (MELAS), with a 
multisystem presentation that includes seizures, impaired hearing, 
stroke-like episodes and lactic acidosis. Mutations in the nuclear- 
encoded mitochondrial DNA polymerase gene POLG underscore 
pleiotropy: some mutations result in mild ocular-muscle weakness, 
whereas others produce Alpers’ syndrome (characterized by psy- 
chomotor regression, seizures and liver failure). These genetic dis- 
orders can be mimicked by drugs that have off-target toxicity. A 1995 
phase 2 clinical trial for fialuridine — a promising new antiviral for 
hepatitis B — was halted after the drug caused lactic acidosis, myo- 
pathy, neuropathy and even fatal liver failure in some individuals®. 


As our ability to define the genetic and environmental bases of 
mitochondrial disorders has accelerated, new questions have arisen. 
Why are some disorders highly tissue-specific, and others multisys- 
temic? How can loss-of-function mutations in bacteria-conserved 
proteins even be compatible with life? Why do some drugs that 
inhibit mitochondria cause disease, whereas other inhibitors are 
protective against it? What underlies sporadic cases of mitochondrial 
disease? The answers to most of these questions are a mystery, but 
if solved they could provide fundamental insight into metabolism, 
lead to new therapies for orphan diseases, and help us to rigorously 
evaluate the role of mitochondria in common diseases. 

In this Review, we outline four of the key properties of 
mitochondria — their evolutionary origins, metabolic interconnec- 
tions, heterogeneity and robustness — that help to explain some of the 
mysterious features of these disorders. We review recent insight into 
their genetic architecture, and emerging themes in their pathogenesis. 
Finally, we discuss the challenges and opportunities that lie ahead for 
this relatively young field of medicine. 


Origin and evolution of mitochondria 
Mitochondria have an endosymbiotic origin and retain many ves- 
tiges of their bacterial ancestry, including a double membrane anda 
circular genome (the mtDNA). They resemble microbes in that they 
are typically about one micrometre in scale and constantly move, 
divide and fuse to form a dynamic network. Although mitochon- 
dria are referred to as semi-autonomous organelles, billions of years 
of expansive and reductive evolution (Fig. 2a) — accompanied by 
transfer of most of their genes to the nuclear genome — have now 
effectively hard-wired these organelles within eukaryotic cells. 
Human mtDNA is maternally inherited and encodes only 13 pro- 
teins, as well as the 22 tRNA and 2 ribosomal RNA genes required 
for their translation. All other proteins required to maintain and 
express mtDNA are encoded by the nuclear genome. Great progress 
has been made in defining the mammalian mitochondrial proteome, 
with more than 1,100 proteins assigned to this compartment’. Inter- 
estingly, mtDNA has a monophyletic origin®, whereas the history 
of the mitochondrial proteome is far more complex””. Of the 1,100 
known mitochondrial proteins, about two-thirds have bacterial 
origins (probably from multiple phyla), with the rest representing 
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Figure 1 | Phenotypic spectrum of mitochondrial disorders. a, Common 
clinical manifestations of mitochondrial disorders. b, Clinical images 
depicting pathology from patients with a variety of mitochondrial disorders. 
Clockwise from top left, 3-Tesla fluid-attenuated inversion-recovery brain 
magnetic resonance imaging demonstrating Leigh syndrome lesions, which 
are characterized in this image by a hyperintense signal within the caudate and 
putamen bilaterally (arrows) seen on an axial cut through the basal ganglia; 
retinal image of the acute phase of Leber’s hereditary optic neuropathy, 
demonstrating an optic disc with swollen nerve fibre layer that is associated 
with engorged and obscured blood vessels (arrows); ragged red fibre (arrow) 


eukaryotic innovations’. The mosaic composition of human mito- 
chondria is evident in the organelle’s replication and translation 
machinery, with the ribosome closely resembling its bacterial 
counterpart'’ and the DNA polymerase resembling that of a viral 
(bacteriophage) ancestor’. As discussed later, the molecular basis of 
certain mitochondrial pathologies becomes clear when the ancestry 
of the organelle is taken into account. 

During the course of evolution, the organelle ceded ownership of cer- 
tain pathways to the rest of the cell, but retained and even acquired oth- 
ers. For example, ribonucleotide reductase — which is used for de novo 
synthesis of deoxyribonucleotides — is found only in the cytosol, and 
deficiency of this enzyme causes mtDNA depletion syndrome”. In other 
cases, the mitochondria have retained a duplicated copy of the cytosolic 
pathway, such as for tetrahydrofolate-dependent one-carbon metabo- 
lism’. These paralogous one-carbon pathways seem to have adopted 
a different functional importance, and may be particularly relevant in 
disease states such as cancer’*. Understanding the logic of compartmen- 
talization and paralogous pathways is an ongoing challenge. 


Respiratory chain and its connections 

At the heart of mitochondria is the respiratory chain, the core 
machinery for oxidative phosphorylation (Fig. 2b). Classically, the 
respiratory chain is defined as four macromolecular complexes 
that catalyse electron transfer from reducing equivalents, which are 
derived from intermediary metabolism, to molecular oxygen. Free 
energy is conserved by coupling electron transport to the formation 
of a proton gradient, or proton motive force (PME), by three of these 
complexes (I, III and IV), which is then dissipated by F,F,-ATPase 
(complex V) for ATP synthesis. These complexes are associated with 
the inner membrane and consist of about 90 protein components, 
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seen on a modified Gomori-trichrome-stained skeletal-muscle section; 
anterior four-chamber cross-section of a heart that shows signs of hypertrophic 
cardiomyopathy, including cardiomegaly and asymmetrical septal 
hypertrophy; plain abdominal radiograph, showing massive bowel distention 
(arrow) in the setting of chronic intestinal pseudo-obstruction without 
evidence of mechanical obstruction; and bone-marrow aspirate sample that 
has been stained for iron to demonstrate a ringed sideroblast (arrow) — 
characterized by a halo of iron-laden mitochondria around the nucleus of an 
erythrocyte precursor — from a patient with myopathy, lactic acidosis and 
sideroblastic anaemia syndrome. 


only 13 of which are mtDNA-encoded. Although typically depicted 
as a linear chain operating in isolation, the respiratory chain is truly 
a hub in the network of cellular metabolism that is characterized by 
convergence and divergence of pathways, supercomplex formation 
and reversibility. 

Almost all of the cell’s redox reactions ultimately feed into the 
respiratory chain (Fig. 2b). Complexes I and II mediate two-electron 
transfer from NADH and FADH,, respectively, to the mobile elec- 
tron carrier coenzyme Q, providing links to the tricarboxylic acid 
(TCA) cycle. Coenzyme Q can also receive electrons from de novo 
pyrimidine biosynthesis, fatty-acid and amino-acid oxidation, 
choline oxidation (ultimately affecting one-carbon metabolism), 
and glycolysis. Complex III, through its ‘Q-cycle; is an adaptor that 
receives two electrons from reduced coenzyme Q and funnels indi- 
vidual electrons to cytochrome c. Complex IV ends the respiratory 
chain by accepting electrons from cytochrome c and using them 
to fully reduce oxygen to water. Reactive oxygen species (ROS) are 
potentially toxic by-products of these reactions — especially at com- 
plexes I and HI — but are buffered by dedicated superoxide dis- 
mutase and catalase, as well as glutathione, thioredoxin and protein 
thiol systems. Interruptions to the respiratory chain can therefore 
affect nucleotide pools, TCA-cycle flux, one-carbon metabolism and 
ROS signalling to unleash numerous ripples (discussed later). 

The PMF is best known for driving ATP synthesis through oxi- 
dative phosphorylation, but it is linked to many other processes 
(Fig. 2b). The nicotinamide nucleotide transhydrogenase relies on 
the PMF to regenerate mitochondrial NADPH, which is required 
for ROS homeostasis. Furthermore, the PMF is coupled to solute 
and ion transport across the inner membrane, and collapse of the 
PME can halt essential biosynthetic reactions, such as Fe-S cluster 
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Figure 2 | Mitochondrial evolution and the respiratory chain. a, The 
modern human mitochondrial proteome consists of 13 proteins, which 
are encoded by mitochondrial DNA (mtDNA) and are a vestige of the 
original proteobacterial genome, as well as at least 1,100 additional 
proteins that are known to be encoded by the nuclear genome (nuDNA). 
Of these proteins, about 400 have a proteobacterial origin, determined 
by sequence similarity to the closest living relative of the ancestral 
proteobacterial species, Rickettsia prowazekii’*, with the remaining 
ancestral proteins lost during evolution. About 400 proteins were 
obtained from other bacterial organisms — estimated by determining the 
number of mitochondrial proteins with homologues in other prokaryotic 
organisms. About 300 proteins have no homologue in any prokaryotic 
organisms, and are eukaryotic innovations. b, On the left is the classic 
view of complexes I to V, with the number of mtDNA and nuclear-DNA- 
encoded subunits indicated. In the centre are selected biochemical 
pathways that are coupled to electron flow through the respiratory chain. 


biogenesis and protein import. The importance of the PMF is exem- 
plified by the fact that glycolytic ATP can be consumed by complex 
V that is run in reverse to defend PMF during states of respiratory 
chain inhibition. 

Although respiratory chain complexes are commonly depicted as 
freely moving through a random-collision model”, there is grow- 
ing evidence to support a solid-state model’®, in which individual 
complexes are physically grouped into supercomplexes. Native gel 
separations have demonstrated interactions between complexes I, III 
andIV’”"”’. In principle, such associations may promote stability and 
substrate channelling while minimizing ROS formation. The recent 
identification of genetic factors that are required for supercomplex 
assembly (reviewed in ref. 20) will allow a rigorous evaluation of 
their role in disease. 

Comparative analysis across species suggests that many path- 
ways connected to the respiratory chain are still undiscovered. For 
example, human complex I consists of about 45 subunits, includ- 
ing 14 core catalytic subunits that are found in bacterial complex I. 
Recently, the crystal structures of complex I have been elucidated 
for Escherichia coli” and Yarrowia lipolytica”. The putative mecha- 
nism raised by both structures indicates that the known functions, 
electron transfer and proton pumping, are accomplished by the 14 
ancient subunits. What functions do the remaining subunits serve? 
Although the leading hypothesis is that they are required for assem- 
bly, stability or regulation, it is tempting to speculate that complex I 
may be a scaffolding for additional enzymatic activities. For example, 
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Inner membrane 


Complex I and IJ transfer electrons from NADH and FADH,, respectively, 
to coenzyme Q, providing a link with the tricarboxylic acid (TCA) cycle. 
Coenzyme Q additionally receives reducing equivalents from glycolysis 
through mitochondrial glycerol-3-phosphate dehydrogenase (GPDH,,), 
de novo pyrimidine biosynthesis through dihydroorotate dehydrogenase 
(DHODH), choline oxidation and one-carbon metabolism through 
choline dehydrogenase (ChDH), and fatty acid and amino acid oxidation 
through electron-transferring flavoprotein dehydrogenase (ETF). On the 
right are selected processes that are coupled to the proton motive force 
(PMF), including ATP generation through complex V, calcium transport 
through the uniporter (U), NADPH generation through nicotinamide 
nucleotide transhydrogenase (NNT), ATP/ADP exchange through the 
adenine nucleotide translocator (ANT), protein import through the 
translocase of the inner mitochondrial membrane (TIM). Inorganic 
phosphate (Pi) transport is through its carrier (P). AY,,, mitochondrial 
membrane potential. 


m 


one subunit that is associated with complex I, NDUFAB1, has an 
acyl carrier protein domain that participates in type II fatty-acid 
synthesis” ; and the assembly factor ACAD9 shares sequence homol- 
ogy with very-long-chain fatty-acid dehydrogenase”. Moreover, one 
subset of complex I shares a phylogenomic signature with enzymes 
that are related to branched-chain amino-acid oxidation’. The other 
potential moonlighting roles that complex I may have require fur- 
ther investigation and could help us to understand the pleiotropic 
features that are observed in patients. 


Mitochondrial heterogeneity 

The extent to which mitochondria are specialized within each cel- 
lular context is underappreciated. Mitochondria from different 
organs exhibit distinct patterns of fuel use and biosynthetic capaci- 
ties. For example, skeletal-muscle mitochondria are adept at oxi- 
dizing fatty acids, and brain mitochondria are capable of oxidizing 
ketones, whereas adrenal mitochondria have a high capacity for ster- 
oid hormone biosynthesis. Electron microscopy studies have shown 
that mitochondrial content and morphology can be highly variable 
across tissues (Fig. 3a), owing to changes that occur in development 
and in response to environmental cues. Even within individual cells, 
mitochondria can exhibit heterogeneity: intermyofibrillar and sub- 
sarcolemmal mitochondria within skeletal muscle have distinct fuel 
preferences and respond differentially to physiological and patho- 
logical inputs”. In fact, one hypothesis postulates that mtDNA 
has persisted in evolution to endow individual mitochondria with 


© 2012 Macmillan Publishers Limited. All rights reserved 


the ability to tune their local energetic and redox state through local 
control of gene expression, the so-called co-location for redox 
regulation hypothesis”. 

Recent proteomic surveys of mitochondria from different organs 
have quantified the level of molecular heterogeneity, revealing that 
mitochondria from two distinct organs will typically share about 75% 
of their components’. This tissue heterogeneity has a wide range of 
functional consequences’”*. For example, whereas most respiratory 
chain complexes are invariant across organs, complex IV exhibits 
tissue-specific isoforms’ that were previously posited as crucial for 
responding to local oxygen tension”. The mitochondrial ribosome is 
used to translate the 13 mtDNA-encoded respiratory chain subunits, 
all of which are essential and found in all tissues. However, the protein 
composition of the mitochondrial ribosome exhibits striking tissue 
diversity’, the consequences of which are currently unknown. It is 
tempting to speculate that this variability contributes to the tissue- 
specific effect of certain mtDNA mutations. 

An exciting new frontier in mitochondrial biology is defining the 
regulatory mechanisms that give rise to the observed heterogene- 
ity, both within cells and across tissues (Fig. 3b). These mechanisms 
can be grouped into four broad categories, each of which has been 
reviewed elsewhere: organelle biogenesis*”*', movement”, fusion 
and fission*’, and mitophagy™. Co-location for redox regulation 
may in principle contribute to intracellular heterogeneity, although 
this hypothesis is in need of rigorous testing, and an understanding 
of the molecular mechanisms is still lacking. Mutations that affect 
some of these regulatory programs have already been identified as 
contributors to disease. For example, mutations in MFN2 and OPA1, 
both of which are required for mitochondrial fusion, cause Charcot- 
Marie-Tooth disease type 2A and autosomal dominant optic atrophy, 
respectively. PINK1 and parkin are required for mitophagy, and are 
mutated in Mendelian forms of Parkinson's disease. 
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Figure 3 | Mitochondrial tissue heterogeneity and robustness. a, Electron 
micrographs of mitochondria from various tissues, highlighting the diversity 
across tissues (scale bar, 200 nm). b, Processes that give rise to mitochondrial 
heterogeneity across tissues and within cells: movement, biogenesis, 
mitophagy, fusion-fission. Key regulators of these processes are listed. DRP1 
encoded by DNM1L; ERRa encoded by ESRRA; FIS1 encoded by FIS1; 
FUNDCI encoded by FUNDC1; GABPA encoded by GABPA; MFF encoded 
by MFF; MEN1 encoded by MFN1; MEN2 encoded by MFN2; MiD49 encoded 
by SMCR7; MiD51 encoded by SMCR7L; Milton, which represents the Milton 
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Robustness 


The mitochondrial respiratory chain is a robust system, which is capable 
of responding to fluctuating nutrient availability and demands. Cardiac 
mitochondria provide a striking example of robustness: they are capable 
of maintaining a constant ATP to ADP ratio over a fivefold dynamic 
range in workload in vivo during exercise. Two mechanisms underly- 
ing this regulation have been identified. First, Chance and Williams”, 
in their pioneering studies of isolated mitochondria, demonstrated that 
under many experimental conditions the rate of respiration and ATP 
synthesis is largely controlled by the availability of ADP — a feedback 
mechanism they termed respiratory control. The second mechanism 
posits that calcium operates in a feed-forward manner to ensure matched 
ATP use in the cytosol and its production in mitochondria*’. Many cyto- 
solic processes, such as neurotransmission and muscle contraction, are 
triggered by a rise in cytosolic calcium, and the same calcium signal can 
be transmitted into the matrix through the uniporter to stimulate the 
TCA cycle to ensure ATP production. 

Robustness probably extends across the entire respiratory chain and 
its many coupled reactions, as evidenced by classic studies of metabolic 
control analysis (MCA)**”’. MCA provides an experimental and theoreti- 
cal framework with which to understand the distribution of flux con- 
trol of a system property. For a system flux J, the control coefficient C’,, 
exerted by enzyme E, is defined as (AJ/J)/(AE,/E,), in which 2C),, = 1. Cy, 
denotes the percentage of control exerted by a single enzyme on the sys- 
tem. Experimentally, small molecules can be used to modulate E, while 
Jis measured. MCA has been used“ to show that control of respiration 
is broadly distributed and, moreover, that the distribution of control 
depends on workload. An important implication of such work is that the 
activity of a respiratory chain complex can be varied over a wide regime 
before overall respiration is affected (Fig. 3c). 

Robustness of mitochondrial metabolism has important implications 
for understanding disease”. First, the robustness may be the reason why 
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family of proteins encoded by TRAK1 and TRAK2 in humans; Miro, which 
represents the MIRO family of proteins, encoded by RHOTI and RHOT2 in 
humans; NIX encoded by BNIP3L; NRF1 encoded by NRF1; OPA1 encoded 
by OPA1; Parkin encoded by PARK2; PGC1la encoded by PPARGC1A; PGC1B 
encoded by PPARGC1B; and PINK] encoded by PINK1. c, Respiratory rate in 
isolated mitochondria from five types of rat tissue with complex IV inhibited 
to varying degrees with KCN. The rate is expressed as a percentage of the 
respiratory rate observed in untreated mitochondria from that tissue. Panel c 
adapted with permission from ref. 42. 
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even severe mutations can be tolerated within the oxidative phospho- 
rylation system and be compatible with life. Robustness of mitochon- 
dria implies that the system will be tolerant of hypomorphic alleles, 
which may be genetic contributors to disease when compounded with 
the appropriate stress, environmental modifier or a second genetic hit. 
The control coefficients of different respiratory chain complexes have 
been shown” to vary across rat tissues (Fig. 3c), leading to speculation 
that tissue-specific control coefficients have an influence when pathol- 
ogy becomes manifest. However, this attractive hypothesis still needs 
formal proof. 


Genetics of mitochondrial disorders 

Over the past 25 years, studies of individual patients and families 
who are affected by mitochondrial disorders have yielded a wealth of 
insight into their genetic architecture. These genetic lesions can lie 
in the mtDNA or nuclear DNA and have revealed the pathways that 
support respiratory chain assembly and activity (Fig. 4 and Table 1). 
These genetic studies have also helped to expand the phenotypic 
spectrum of mitochondrial disorders. 


Mutations in the mtDNA 

The sequencing of human mtDNA in 1981 (ref. 43) launched the 
molecular era of mitochondrial medicine. Two landmark papers 
in 1988 (refs 44 and 45) reported point mutations and deletions 
in mtDNA in LHON and mitochondrial myopathy, respectively. 
Since then, more than 300 point mutations, deletions and dupli- 
cations have been associated with a wide variety of symptoms 
(http://www.mitomap.org). mtDNA is oocyte-derived, so inher- 
ited forms of these disorders follow maternal inheritance. Although 
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Figure 4 | Genetic pathways underlying mitochondrial respiratory chain 
disorders. The genes that are known to be mutated in respiratory chain 
disorders can be grouped in five broad categories on the basis of the 
pathway in which they participate. The products of genes identified so far 
are listed in Table 1, and include those involved in disorders of individual 
oxidative phosphorylation subunits of complexes I-V; proteins involved in 
mtDNA replication, transcription or translation; proteins involved in the 
assembly of oxidative phosphorylation complexes, mitochondrial protein 
import and protein homeostasis (grouped under the pathway of oxidative 
phosphorylation biogenesis and regulation); the proteins that are involved 
in nucleotide transport and synthesis; and proteins that are involved in 
the control of membrane composition and dynamics. The pathways in this 
figure are adapted from refs 47 and 94. 
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some mtDNA disorders (such as LHON) are homoplasmic, others 
(such as MELAS) are heteroplasmic. Cells contain a mixture of 
wild-type and mutant mtDNA molecules in heteroplasmic disor- 
ders, and disease expression only occurs when the mutant mtDNA 
load exceeds a threshold. Stochastic segregation of mtDNA mol- 
ecules during development can therefore cause variable tissue 
expression of disease. 


Mutations in nuclear genes 

In 1995, the first underlying nuclear gene mutation of a mitochon- 
drial disorder was identified when the Munnich and R6tig groups 
reported mutations in SDHA in a patient with complex II deficiency 
and Leigh syndrome”. Sequencing of the human genome, combined 
with characterization of the mitochondrial proteome”, has propelled 
the discovery of nuclear disease genes and pathways that are required 
for respiratory chain assembly and function. Based on recent com- 
pilations*”** and disease databases (http://omim.org), there are now 
over 110 nuclear genes that are known to be mutated in respiratory 
chain disease, and this number is rapidly expanding. The genes that 
have been identified so far can be organized into five broad pathways 
involved in the expression, assembly and activity of the oxidative 
phosphorylation system (Fig. 4 and Table 1). 

Mutations in the nuclear-encoded tRNA synthetases underscore 
the phenotypic heterogeneity that can be associated with the same 
pathway. Although all of these gene products facilitate translation of 
13 mtDNA-encoded respiratory chain proteins, their clinical presen- 
tations are quite distinct**. For example, mutations in EARS2 present 
as leukoencephalopathy and high cerebrospinal fluid lactate, YARS2 
as myopathy and sideroblastic anaemia, HARS2 as ovarian failure, 
AARS2 as hypertrophic cardiomyopathy and SARS2 as pulmonary 
hypertension and renal failure. Moreover, these phenotypes do not 
match those arising from mutations in the corresponding tRNAs 
that are encoded by the mitochondrial genome. Mitochondrial het- 
erogeneity — perhaps at the level of the nuclear-encoded mitochon- 
drial ribosome’ — or moonlighting roles of tRNA synthetases” may 
underlie the tissue-specific pathology. 

Although most of the nuclear genes that underlie respiratory 
chain disease encode mitochondrial proteins, the small subset 
that do not provides valuable insight into the cross-talk between 
the organelle and the rest of the cell. For example, as already men- 
tioned, mitochondria are reliant on some cytosolic pathways for 
proper nucleotide homeostasis, and mutations in the genes that 
encode ribonucleotide reductase, RRM2B (ref. 12), and thymidine 
phosphorylase, TYMP (ref. 50), disrupt mtDNA maintenance. Muta- 
tions in WFS1, which encodes an endoplasmic-reticulum resident 
protein, cause Wolfram syndrome that is characterized by deafness, 
diabetes mellitus, diabetes insipidus and optic atrophy. Although 
not formally classified as a mitochondrial disorder, the phenotypic 
overlap and presence of mtDNA deletions in some patients has led 
to speculation that WFS1 mediates interactions between the endo- 
plasmic reticulum and mitochondria”’. This hypothesis is supported 
by the observation in Saccharomyces cerevisiae that a component 
of the endoplasmic-reticulum-mitochondria encounter struc- 
ture’, MMM1, co-localizes with mtDNA nucleoids and has a role 
in mtDNA stability”’. We anticipate that research into the genetics 
of mitochondrial disorders will continue to reveal unexpected con- 
nections, either physical or functional, between mitochondria and 
the rest of the cell. 


Environmental modifiers 

Environmental factors can influence the course of genetic mitochon- 
drial disorders, and even phenocopy them. For example, tobacco 
use and heavy consumption of alcohol are risk factors for loss of 
vision in LHON™. Exposure to toxic substances can produce pathol- 
ogy that resembles LHON, as was seen in an epidemic of blindness 
that occurred in Cuba in the early 1990s. The cause was ultimately 
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Table 1 | Products of genes that are known to be mutated in respiratory chain disorders grouped by pathway 


Oxidative phosphorylation subunits mtDNA maintenance and expression Oxidative phosphorylation biogenesis and Nucleotide Membrane 
regulation transport and dynamics and 
synthesis composition 
Nuclear encoded 
Complex I: NDUFA1, NDUFA2, NDUFAQ, TWINKLE, MTFMT, GFM1, LRPPRC, Complex I: NDUFAF1, NDUFAF2, DGUOK, ADCK3, AGK, 
NDUFA10, NDUFA11, NDUFA12, NDUFB3, MPV17, MRPS16, MRPS22, POLG, NDUFAF3, NDUFAF4, NDUFAF5, RRM2B, COQ2, COQ6, 
NDUFB9, NDUFS1, NDUFS2, NDUFS3, POLG2, TRMU, TSFM, TUFM, NDUFAF6, ACAD9Y, FOXRED1, SLC25A3, COQ9, DRP1, 
NDUFS4, NDUFS6, NDUFS7, NDUFS8, NDUFV1, C12orf65, MTPAP, MRPL3, SARS2, NUBPL ANT1, SUCLA2, MFN2, OPA1, 
NDUFV2 YARS2, HARS2, MARS2, AARS2, Complex Il: SDHAF1, SDHAF2 SUCLG1,TK2, PDSS1, 
Complex II: SDHA, SDHB, SDHC, SDHD RARS2, EARS2, DARS2, TACO1, Complex III: BCS1L, HCCS, TTC19  TYMP PDSS2, TAZ, 
Complex III: UQCRB, UQCRQ MTO1, RMND1, PNPT1, PUS1 Complex IV: COX10, COX15, SERAC1 


Complex IV: COX412, COX6B1 
Complex V: ATP5E 


ETHE1, FASTKD2, SCO1, SCO2, 
SURF1, COX14, COA5 


mtDNA encoded 
Complex I: ND1, ND2, ND3, ND4, ND4L, ND5, 12S rRNA, tRNATyr, tRNATrp, 
ND6 tRNAVal, tRNAThr, tRNASer1, 


Complex III: CYTB 
Complex IV: COX1, COX2 
Complex V: ATP6, ATP8 


tRNASer2, tRNAArg, tRNAGIn, 
tRNAPro, tRNAAsn, tRNAMet, 
tRNALeu1, tRNALeu2, tRNALys, 


Complex V: ATPAF2, TMEM70 
Fe-S: ABCB7, FXN, ISCU, NFU1, 
BOLA3, GLRX5 

Other: DNAJC19, GFER, HSPD1, 
SPG7, TIMM8A, AIFM1, AFG3L2 


tRNAlle, tRNAHis, tRNAGly, tRNAPhe, 
tRNAGIlu, tRNAAsp, tRNACys, tRNAAla 


List of gene products was generated through synthesis of existing compilations of genes known to be mutated in respiratory-chain disease’ 


found to be widespread folate deficiency combined with methanol 
toxicity from homemade rum”. Formate, a by-product of metha- 
nol metabolism, accumulates in the setting of folate deficiency and 
causes inhibition of complex IV. 

Certain medications have long been known to have toxic effects on 
mitochondrial function. Owing to the bacterial origins of the mito- 
chondrial ribosome, mtDNA translation can be adversely affected by 
antibiotics such as aminoglycosides, which can cause sensorineural 
deafness when administered at high doses. Individuals with certain 
mtDNA mutations in the 12S ribosomal rRNA — estimated to have a 
population prevalence of 0.19% — are predisposed to deafness from 
aminoglycosides, and can experience hearing loss even with short 
exposure to the recommended doses*””. The viral origins of the 
mitochondrial DNA polymerase make it susceptible to nucleoside 
analogue antivirals (including fialuridine). This class of drugs is used 
commonly for HIV, and can cause side effects such as lactic acidosis 
through mtDNA depletion™. 

Microorganisms are an emerging class of environmental modifi- 
ers, ranging from gut microbiota to viruses. Gut bacteria are drivers 
of disease progression in ethylmalonic encephalopathy. The causal 
gene, ETHE1, encodes an enzyme that detoxifies sulphur compounds, 
which are released by gut microbiota, and its loss results in H,S 
accumulation, with subsequent inhibition of complex IV and short- 
chain acyl-CoA dehydrogenase. Treatment with metronidazole (an 
antibiotic that reduces gut microbial content) and N-acetylcysteine 
(which promotes glutathione-mediated detoxification of H,S) results 
in clinical improvement”. Although a role for viral infections in the 
course of mitochondrial disorders has not been identified, experi- 
mental studies indicate that some viruses, such as HIV”, can modu- 
late complex I activity. 


Unsolved cases of mitochondrial disease 

Over the past 25 years, most of the mitochondrial disease genes have 
been identified in familial forms of disease, in which it is possible 
to follow the segregation of highly penetrant causal alleles. In our 
experience at Massachusetts General Hospital, we have found that 
less than 25% of patients with clinical and biochemical evidence of 
mitochondrial disease have strong evidence of an affected first- or 
second-degree relative. Several recent studies have applied exome 
sequencing to establish molecular diagnoses in singleton cases°*™. 
However, the success rate is lower than anticipated. Our experi- 
ence has shown that in biochemically proven, severe cases in infants 
exome sequencing should achieve a diagnosis in about half of cases”; 


4748 as well as review of the literature. 


however, the success rate is projected to be much lower in milder 
cases of disease, or those with adult onset. 

How can we explain these unsolved cases? Most exome studies of 
singleton cases have been powered to identify recessive mutations in 
mitochondrial proteins. It is possible that these unsolved cases are a 
result of dominant-acting, subtle recessive or regulatory mutations 
with incomplete penetrance, all of which are difficult to identify 
over the background of polymorphisms. Alternatively, these cases 
could be due to mutations in regions of DNA that are not targeted 
for sequencing. Some cases could also have purely environmental 
causes, as with the Cuban blindness epidemic. A tantalizing pos- 
sibility is that a subset of these unsolved cases is due to complex 
genetic inheritance, as a result of interactions between gene vari- 
ants that each have weak or synergistic effects, known as synergistic 
heterozygosity®. Targeted exome sequencing studies have reported 
that healthy controls will typically carry a burden of about 15-20 
heterozygous, loss-of-function protein alleles within their mito- 
chondrial proteomes”. This high burden of deleterious alleles is 
probably tolerated because of the robustness of mitochondrial net- 
works. However, it is possible that mutations that affect multiple 
genes, operating in the same or parallel pathways, may conspire 
to yield pathology. Notably, there is suggestive evidence that sup- 
ports synergistic interactions between mtDNA and nuclear DNA 
variants”. Defining the genetic architecture of the large number of 
unsolved sporadic cases of mitochondrial disease represents the next 
major challenge of mitochondrial genetics. 


Mitochondrial ripples and responses 

Despite remarkable progress in defining the genes and environmen- 
tal triggers that underlie mitochondrial disease, their pathogenesis 
remains almost a complete enigma. Many medical textbooks offer 
the oversimplified explanation that disease manifests in tissues with 
the highest ATP demand, or because of oxidative damage. Although 
these factors probably contribute to disease, the picture is much 
more complicated. Cellular models of disease indicate that there is 
a remarkable capacity for preservation of ATP production through 
enhanced glycolysis”, and, in animal models of respiratory chain 
dysfunction, pathology can develop without a major increase in oxi- 
dative damage”, Furthermore, the fact that inhibition of respira- 
tion can be tolerated, and even beneficial in certain settings (Box 1), 
indicates that the consequences of respiratory chain lesions are not 
uniformly bad, and suggests the involvement of nonlinear modes of 
pathogenesis and threshold effects. 
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BOX1 


Although respiratory chain inhibition is often viewed as 
undesirable, several observations have suggested that the 
consequences can be beneficial in certain situations. Metformin, 
the most commonly prescribed oral medication for type 2 
diabetes mellitus, inhibits complex I. This bioenergetic effect 

is thought to contribute to metformin’s inhibition of hepatic 
gluconeogenesis®®. Lactic acidosis is a known — although 
infrequent — side effect of metformin treatment, highlighting the 
fine balance between therapeutic and toxic effects of respiratory 
chain inhibition. Pretreatment with small-molecule inhibitors of 
the respiratory chain can protect organs, such as the brain and 


Can inhibition of the respiratory chain be beneficial? 


heart, against ischaemia-reperfusion injury®®. Finally, data from 
studies in both the roundworm Caenorhabditis elegans®’ and the 
fruitfly Drosophila melanogaster®® indicate that RNA-interference- 
mediated knockdown of several components of the respiratory 
chain extends lifespan. The general theme that emerges from 

all of these observations is that the cellular and organismal 
consequences of modest respiratory chain inhibition can be used 
for therapeutic purposes — a concept that some have termed 
mitochondrial hormesis. We suspect that therapeutic effects 
occur when the balance of responses to a respiratory chain lesion 
weighs in favour of homeostasis as opposed to pathogenesis. 


To systematically understand pathogenesis, it is convenient to 
consider the proximal consequences of respiratory chain lesions — 
or mitochondrial ripples — and the secondary cellular responses 
they evoke. Model-organism studies provide several examples 
of such ripple-response cascades. The ‘retrograde’ response in 
S. cerevisiae is a defined transcriptional response that allows the 
survival of that organism in the setting of an impaired respiratory 
chain, mainly by compensating for incomplete TCA-cycle function 
to maintain production of metabolites such as glutamate™. The 
precise ripples that trigger the yeast retrograde response remain 
unclear, although loss of the PMF is thought to have a key role”. 
In the fruitfly Drosophila melanogaster, respiratory chain impair- 
ment can trigger blockade of the G1-S transition of the cell cycle 
through two possible ripple-response pathways: mutation of a 
complex-I subunit leads to ROS-mediated activation of JNK sig- 
nalling”’, whereas diminished ATP production as a consequence of 
a complex-IV mutation causes AMPK activation”'. Studies in the 
roundworm Caenorhabditis elegans have defined a mitochondrial 
unfolded protein response (UPR™) that is activated by disturbed 
protein homeostasis resulting from insults to the respiratory chain”. 
The UPR™ comprises increased expression of mitochondrial chaper- 
ones, and, notably, mutations in this pathway can give rise to several 
neurodegenerative disorders’’. The lessons that have emerged from 
model-organism studies are that a broad range of ripple-response 
pairs can result from respiratory chain lesions, and that these pairs 
confer remarkable tolerance to such insults. However, some of these 
may be adaptive on short timescales and pathology-inducing over 
longer timescales. 

Metabolomics studies have attempted to systemically catalogue 
biochemical ripples that emanate from respiratory chain inhibition. 
One study characterized the effect of small-molecule inhibitors of 
the respiratory chain on metabolite flux in cultured cells”* and cor- 
related these changes to patient plasma measurements. Specific 
blockade of complex III resulted in decreased production of uri- 
dine, which was consistent with the long-standing observation that 
respiratory chain-deficient cells are uridine auxotrophs”*. Lactate 
secretion highlighted that the induction of the lactate dehydroge- 
nase reaction is a means to support glycolytic ATP production and 
consume NADH to maintain redox cofactor balance. Another study 
reported that reverse flux through the TCA cycle, specifically reduc- 
tive carboxylation of a-ketoglutarate to isocitrate, is a response to 
respiratory chain inhibition”. 

Ideally, investigation of ripple-response cascades should explain 
the end pathology. One promising cascade involves calcium. Mito- 
chondria have a major role in shaping the cytosolic calcium concen- 
trations through uptake with a uniporter’’”, which is dependent 
on an intact PMF. Several studies have converged on the mecha- 
nism that an increase in cytosolic calcium, secondary to loss of 
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mitochondrial PME, is a key signalling intermediate in response 
to respiratory chain lesions through activation of calcium-sensitive 
signalling factors (such as calcineurin and calcium/calmodulin- 
dependent protein kinase IV (CaMKIV)***'). We hypothesize that 
activation of calcium-dependent signalling may in fact help to 
explain several pathological hallmarks (Fig. 1). For example, activa- 
tion of CaMKIV can induce mitochondrial biogenesis”, potentially 
contributing to the finding of ‘ragged red fibres’ that are often seen 
on skeletal-muscle biopsy. Activation of calcineurin has also been 
found to induce hypertrophic cardiomyopathy”. Altered calcium 
dynamics in gastrointestinal interstitial cells of Cajal compromise 
their pacemaking activity™, and this may contribute to intestinal 
pseudo-obstruction (Fig. 1b). However, not all respiratory chain 
mutations disrupt calcium homeostasis*’. Moreover, mitochondrial 
calcium buffering can vary across tissues*’, and this may shape the 
pattern of tissue expression. 

Ripple-response cascades can flow outside of cells, leading to 
non-cell-autonomous effects. Lactic acidosis is the best known 
example, and can impair the function of multiple organs by reduc- 
ing serum pH. Fibroblast growth factor 21, a hormone that medi- 
ates aspects of the starvation response, is released from the skeletal 
muscle of patients with mitochondrial myopathy”. This has led 
to the intriguing hypothesis that it may drive systemic metabolic 
pathology. Recent evidence has pointed towards a crucial role for 
mitochondria in the regulation of innate immunity™; however, its 
role in mitochondrial disorders is largely unexplored. 


Future prospects and challenges 
Understanding the pathogenesis of mitochondrial disorders is an 
exciting new frontier, with many opportunities. First, as a group, 
these disorders affect at least 1 in 5,000 individuals, and there are 
no proven therapies®’. There is, therefore, a special opportunity to 
develop therapeutics for patients who otherwise have distressingly 
few options. Second, these disorders are a continuous source of 
insight into basic cell biology, and much of what we know about res- 
piratory chain assembly and compartmentalization of metabolism 
has come directly from studying them. Finally, there is great inter- 
est in the role of mitochondria in many common human diseases, 
which is fuelled by the observation that many common, degenerative 
disorders are associated with a quantitative decline in mitochondrial 
activity. Distinguishing cause from correlation, however, is challeng- 
ing and mitochondrial disorders may be valuable tools for clarify- 
ing the organelle’s role in common disease. Capitalizing on these 
opportunities will require that we fully decipher the pathogenesis 
of these orphan diseases. 

We anticipate that two complementary lines of investigation, cell- 
and patient-based, will be required to meet this challenge. First, 
it will be essential to characterize the network-level properties 


© 2012 Macmillan Publishers Limited. All rights reserved 


of mitochondria in cell-based studies. As a result of classic bio- 
chemical studies and (more recent) proteomic studies of isolated 
mitochondria, we have a reasonable knowledge of the organelle’s 
protein inventory and reaction repertoire, as well as a theoretical 
framework for understanding regulatory control. Moving forward, 
it will be crucial to experimentally map interactions among all of 
its components, both at a physical and genetic level, across differ- 
ent cell types. Initial progress has already been made in this area, 
with the recent production of a genetic interaction map, focusing 
on S. cerevisiae mitochondria”. The organelle’s rich evolutionary 
history and transcriptional regulation will facilitate computational 
identification of modules of functionally interacting genes”'. These 
network-level maps will reveal the homeostatic mechanisms that 
buffer against environmental or genetic insults. Second, in parallel, 
it will be crucial to catalogue genotypes and phenotypes from indi- 
vidual patients throughout the world. Next-generation sequencing 
is already facilitating sequencing of patient genomes, while Internet- 
and wireless-enabled technologies will yield in-depth phenotypes 
with unprecedented temporal resolution. Because the individual 
mitochondrial disorders are so rare and diverse, we anticipate that 
an open-source, collaborative model — in which patients and their 
doctors are partners for biomedical research — will be required so 
that data can be aggregated and shared for discovery. Metabolic pro- 
files, which are obtained from perturbed cells grown in culture as 
well as from human plasma”, may represent the key ingredient for 
connecting cell-level and patient-level data into predictive models 
of pathogenesis. 

Identifying a link between genotype and phenotype is a chal- 
lenge for all diseases. But features unique to mitochondria — their 
relatively well-studied biochemistry, a near-complete protein parts 
list, an ability to study mitochondria in isolation or in situ, the 
large number of monogenic disorders, and an active and engaged 
patient community — provide key advantages that promise to place 
this organelle at the forefront of the burgeoning field of medical 
systems biology. 

Studies of orphan mitochondrial disorders will be pivotal in solv- 
ing one of the most important problems in fundamental metabolism: 
the logic of compartmentalization. Why have certain pathways per- 
sisted within mitochondria, whereas others have been duplicated 
or fully relocated to the cytosol? Mitochondria have retained a 
bacterial type-II fatty-acid synthesis pathway, as well as duplicate 
versions of tetrahydrofolate-dependent one-carbon metabolism and 
the gluconeogenic enzyme phosphoenolpyruvate carboxykinase. 
What advantage is conferred by the gain or loss of such pathways? 
We anticipate that studies of the rare mitochondrial disorders will 
shed light on the in vivo relevance of these pathways, and the cir- 
cumstances in which they are operative. New ways of measuring 
compartment-specific metabolism, both in cultured cells and in vivo, 
will be required to drive this field forward. 

We can be optimistic that understanding mitochondrial pathogen- 
esis will enable the development of new therapeutics for these devastat- 
ing disorders, some of which may also be useful for the treatment of 
common diseases. An example of such repurposing can be seen in the 
history of dichloroacetate, a small molecule that reduces lactic acid pro- 
duction by stimulating pyruvate dehydrogenase. Dichloroacetate was 
tested for the treatment of lactic acidosis in mitochondrial disorders 
but lacked clinical efficacy and resulted in toxic side effects”. Although 
it was largely unsuccessful for use in the treatment of mitochondrial 
disorders, it has since been repurposed and is now being tested in clini- 
cal trials for some cancers. Many cancers have an increased reliance on 
aerobic glycolysis with concomitant lactate production — the so-called 
Warburg effect. Dichloroacetate is being used to reverse this metabolic 
hallmark, with promising early results in human trials”. We anticipate 
that this pattern will continue, and that drug development for mito- 
chondrial disorders will ultimately prove beneficial for patients with 
these rare disorders, as well as those with more common disease. = 
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Metabolic phenotyping in clinical 
and surgical environments 


Jeremy K. Nicholson’, Elaine Holmes’, James M. Kinross’, Ara W. Darzi‘, Zoltan Takats' & John C. Lindon! 


Metabolic phenotyping involves the comprehensive analysis of biological fluids or tissue samples. This analysis allows 
biochemical classification of a person’s physiological or pathological states that relate to disease diagnosis or prognosis at 
the individual level and to disease risk factors at the population level. These approaches are currently being implemented 
in hospital environments and in regional phenotyping centres worldwide. The ultimate aim of such work is to gener- 
ate information on patient biology using techniques such as patient stratification to better inform clinicians on factors 
that will enhance diagnosis or the choice of therapy. There have been many reports of direct applications of metabolic 


phenotyping in a clinical setting. 


ingly dependent on the use of molecular tools that help to 

classify diseases and their subtypes, and to define underlying 
individual variations in patient biology. The application of stratified 
and new therapeutic approaches that have been optimized through 
predictive modelling of deep biological information (for example, 
genetic, metabolic or physiological) on individual patient variation 
will not only have major health-care benefits, but also inevitably lead 
to socioeconomic, health-care deployment, regulatory and research 
changes in the clinic’. One of the most widely applicable areas for the 
development of precision medicine relates to the diverse applications of 
metabolic phenotyping — or metabotyping” — to clinical diagnostics, 
prognostics and molecular epidemiology. The metabotypes of indi- 
viduals can be measured from the composition of accessible biofluids 
or tissues that are sampled in the clinic. Metabotypes vary extensively 
between individuals and populations, and result from the complex 
interplay between host genes, lifestyle, diet and gut microbes”. Thus, 
metabotyping has applications in both population-based disease-risk 
investigation studies and in solving problems related to personalized 
health care and patient stratification’. Hence, the ability to generate 
metabolic phenotypes from large sample cohorts that have been col- 
lected as part of epidemiological studies means that the ensuing enor- 
mous statistical power allows the identification of good candidates for 
metabolic biomarkers of disease risk in different populations (such as 
predictors of elevated blood pressure in so-called metabolome-wide 
association studies’). The gene-environment interactions that deter- 
mine metabotypes are identical to those that determine disease risk in 
the general population, as well as individual susceptibility to disease 
and response to treatment. Thus, metabotypes are both statistically and 
biologically connected to disease risk factors and treatment outcomes, 
and thereby underpin the value of metabolic analysis in a diverse range 
of medical scenarios*. Metabolic phenotypes have been measured and 
mapped indirectly for many centuries — mainly unknowingly. For 
example, the urine wheel was used by physicians to relate the colours, 
smells and tastes of urine samples to likely diagnoses and treatments’. 
More recently, spectroscopic methods have been applied to generate 
multivariate profiles of metabolites, mainly using nuclear magnetic 
resonance (NMR) or mass-spectrometric methods that can measure 
a wide range of metabolites simultaneously. The data are then analysed 
using multivariate statistics (Fig. 1 and Box 1). 


( linical diagnosis, prognosis and treatment selection are increas- 


A number of terms are used to describe the various metabolic-anal- 
ysis procedures. Metabolomics’, for example, essentially describes the 
metabolic composition of a given sample in terms of metabolite pres- 
ence and concentration, the metabolome being the multivariate sum of 
these components. There are about 500 histologically distinct cell types in 
the human body. Each one of these cell types has specific functions and, 
consequently, a different gene expression pattern, proteome and metabo- 
lome. Cellular metabotypes may overlap within histological specimens, 
but they interact in space and time through the connecting vascular and 
lymphatic systems. Humans, therefore, contain more than 500 dynamic 
cellular metabolomes, as well as those of the individual tissue-specific 
extracellular fluid compartments (which are compositionally different 
from their surrounding cells) and the various secretory and excretory 
biological fluids (Fig. 2). Disease processes and medical treatments also 
occur over variable time frames, and metabotypes change dynamically 
with disease and treatment. Thus, the term metabonomics has been used 
since the late 1990s (ref. 7) to describe the metabolic responses of com- 
plex systems to perturbations through time, and how these responses can 
be mapped using appropriate analytical and statistical techniques. This 
stimulus could be disease, nutritional changes, drug therapy, genetic mod- 
ulation or a myriad of other inputs. Specifically, metabonomics addresses 
such phenotypic changes at the level of small-molecule metabolites, and 
usually in the context of analysis of body fluids such as urine or blood 
plasma. The terms metabolomics and metabonomics are widely — and 
often interchangeably — used, having received about 26,000 and about 
10,400 Google Scholar hits, respectively, at the time of writing. Although 
metabolic profiling has been applied in a wide variety of research fields 
over the past 30 years — ranging from microbiology to plant and food 
science, through to animal toxicology and mechanisms of disease — it is 
the clinical areas that are currently receiving the most attention. Apply- 
ing high-throughput metabolic technologies to provide new diagnostic 
biomarkers and to uncover disease mechanisms is an attractive proposi- 
tion. In this Review, we discuss some of the key and developing areas in 
clinical metabotyping, and its range of applications in progressing our 
understanding of human disease processes. 


Metabolic analysis of biofluids, cells and tissue 

A series of interacting metabolic networks that operate in multiple body 
compartments gives rise to a continuum of metabolic processes that con- 
tribute to the overall metabotype, and includes contributions from diet, 
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Figure 1 | Technology platforms and analytical timescales for patient 
journey phenotyping, diagnostic and prognostic biomarker discovery, 
and population disease-risk biomarker modelling. Different analytical 
technologies (Box 1) can be applied to a variety of clinically derived 
biosamples, and the choice of technology is dependent on the timescale 

for reaching a solution to the clinical problem, as well as the analytical 
performance characteristics of the technology. Thus, surgical problems 
require either real-time or near real-time solutions for clinical decision- 
making, whereas histopathological augmentation has a multi-hour timescale. 
The cohort size for predictive modelling for technique optimization depends 
on the biological information obtained. Epidemiological problems, such as 


drugs and gut-microbial activities*. The local phenotypic expression of the 
network interactions is obtained by analysing samples in these compart- 
ments, such as plasma or urine — which are the two most widely used 
clinical diagnostic fluids (Fig. 2) — or tissue. This analysis leads to the 
generation ofa series of static snapshots of metabolic activity that can be 
difficult to interpret in isolation, unless there is overt metabolic disease, 
because of the background presence of physiological variability. Obtain- 
ing time series of samples from individuals who are undergoing diag- 
nostic or prognostic evaluation, or at different stages of a disease process 
allows a longitudinal metabolic pattern or trajectory to emerge that carries 
much more information on site, severity and — potentially — mechanism 
of damage’. Similar arguments can be applied to responses to therapy. 
However, in reality, only a relatively small number of tissue or fluid types 
can be sampled, and the exhaustive analysis of these samples by advanced 
metabolic and spectroscopic techniques (Box 1) still gives only ‘islands of 
information that represent local activities (tissue or specialized biofluids) 
(Fig. 2) or systemic activities that affect the extracellular environment 
(urine and plasma). Thus, one of the challenges of metabolism-based ‘top- 
down systems biology’’ is to try to build mathematical bridges between 
these islands to create system-level models that, in turn, can be used to 
generate biochemical or medical hypotheses for further testing using 
‘bottom-up’ systems-biology methods. Furthermore, urine and plasma 
samples carry very different information sets on various molecules and 
pathways, representing numerous systemic timescales. For example, data 
from plasma provide a description of the metabolic system at the time of 
sampling, although persistent alterations induced by dietary or chronic 
interventions may also be detected. By contrast, information from urine 
is time-averaged because ofits collection and storage in the bladder. There 
are also multiple complex physicochemical interactions and differences 


REIMS-MS iKnife 


public health-care epidemiology and identifying new biomarkers as well as 
surgical risk stratification and pre-operative optimization involve the analysis 
of hundreds or thousands of samples from different populations, the transfer 
of information can cause bottlenecks to data processing and total-cohort 
analysis. In the critical care or surgical setting, large-scale population studies 
can also be used to identify populations at risk of surgical morbidity or a 

poor outcome. GC, gas chromatography; iKnife, intelligent knife; MALDI- 
TOF-MS, matrix-assisted laser desorption ionization time-of-flight mass 
spectrometry; MAS-NMR, magic-angle-spinning—nuclear-magnetic- 
resonance; MS, mass spectrometry; REIMS, rapid evaporative ionization mass 
spectrometry; UPLC, ultraperformance liquid chromatography. 


in analytical matrix properties that not only determine how samples 
need to be prepared and analysed, but also carry other types of dynamic 
diagnostic information that is not demonstrated by simple (molecular 
identity and concentration) compositional analysis. Uniquely, NMR 
spectroscopic approaches do not disturb these complex perturbations in 
dynamic physicochemical interactions between molecules in biofluids, 
whereas this occurs of necessity in mass spectrometry. However, from a 
diagnostic point of view, dynamic chemical features have received rela- 
tively little attention in comparison with purely compositional biomarker 
analysis, despite the fact that some biofluids (such as semen) are highly 
reactive post collection due to intrinsic enzymatic activities’. In terms of 
biological data generation, metabonomic and related methods are highly 
complementary to other ‘omics’ tools such as genomics, metagenomics, 
proteomics and transcriptomics, each of which covers different aspects 
of systemic and cellular-activity space, and all of which are interrelated. 
Systems-biology approaches also seek to integrate these data sets by using 
appropriate multivariate statistical and network modelling to obtain a 
more holistic view of human disease, although in the clinical environment 
multiple omics screening has, to date, rarely been feasible. No one tool, 
metric or platform gives a complete biological picture of a condition, and 
all approaches generate hypotheses that need rigorous testing and valida- 
tion in the field. An advantage to metabolic profiling is that the relatively 
low cost per assay or procedure means it lends itself to large-scale testing 
and, in a clinical setting, useful information from commonly available 
samples (such as urine and plasma) can be obtained. 


Overview of clinical applications 
Metabotyping approaches have been used widely in animal models of 


disease, drug toxicity and drug action, resulting in many advances in the 
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Technology 


NMR spectroscopy and mass spectrometry are the main techniques 
that are used for the metabolic profiling of biofluids (for example, 
urine, blood plasma, amniotic fluid and cerebrospinal fluid) and 
tissues in the form of extracts or intact biopsies***°. New solid-state 
NMR methods allow analysis of samples of less than 1 mg, thereby 
allowing detailed studies of tissue heterogeneity, such as between 
tumour tissue and tumour margins”. 

Both NMR and mass spectrometry can simultaneously identify and 
quantify information on a wide range of small molecules with good 
analytical precision and accuracy, and require only a small amount of 
sample (typically 10-400 ul). NMR spectroscopy is highly reproducible, 
with a detection limit in the sub-micromolar range. All hydrogen- 
containing metabolites in a biofluid are detected simultaneously and 
non-destructively with little sample preparation. Mass spectrometry has 
much lower detection limits, but it is destructive and a more targeted 
approach is often needed with prior separation of metabolites, using 
either chromatography or capillary electrophoresis?’. Thus, mass- 
spectrometry approaches tend to be less reproducible, more platform- 
dependent and susceptible to variability. Some of the earliest clinical 
studies used gas-chromatography—mass-spectrometry, especially for 
detection of inborn errors of metabolism’, and, although still widely 
used, the requirement for chemical derivation of the sample to allow 
metabolite volatilization imposes limitations on its widespread use in 
clinical diagnostics. Liquid chromatography, particularly ultra-high- 
performance liquid chromatography (UPLC), is being used increasingly 
for metabolic profiling. 

Chemometrics — multivariate statistics applied to chemical 
data — are used in clinical metabonomics to reduce the dimensionality 
of complex spectroscopic data sets, and to identify biochemical patterns 
that relate to a disease or an intervention. Linear-projection methods, 
such as principal-components analysis and partial-least-squares 
discriminant analysis, are commonly used to map samples on the basis 
of their biochemical similarity and to extract patterns of metabolites 
that relate to a particular disease“. Principal-components analysis is 
used extensively in metabonomics. This technique transforms the data 
descriptors into a set of linear combinations of the original features 
based on decreasing levels of variance. Any clustering seen is based 
on the data alone, and there is no pre-assignment of sample classes. 
Alternatively, in ‘supervised’ methods, multiparametric data sets can 
be modelled so that the class of separate samples (a ‘validation’ set) 
can be predicted based on a series of mathematical models derived 
from the original data or ‘training’ set. Partial-least-squares analysis is a 


widely used supervised method (using a training set of data with known 
end points). This method relates a data matrix containing independent 
variables from samples, such as spectral intensity values (an X matrix) to 
a matrix containing dependent variables (for example, measurements 
of response) for those samples (a Y matrix). Partial least squares can 
also be combined with discriminant analysis to establish the optimal 
position to place a surface that best separates classes. Other popular 
chemometric methods include hierarchical clustering, self-organizing 
maps and neural networks!4°°, 

Statistical spectroscopy is a form of computational modelling 
that is used to enhance biomarker recovery, allowing improved 
information extraction from a set of spectra. This method generally 
operates on defining correlation structures between variables (signals) 
that are found to be discriminatory between sample classes. Highly 
correlated signals are likely to come from the same molecule or from 
molecules regulated by the same metabolic pathway. Statistical total 
correlation spectroscopy is used to identify correlated signals within 
a data set for biomarker identification. NMR and mass spectrometry 
possess high complementarity in molecular-structure elucidation 
studies. In many metabonomic studies, multiple samples with a 
wide range of biochemical variation are available for both NMR and 
mass-spectrometry analysis, creating an opportunity for statistical 
analysis of signal amplitude co-variation between the two sets of 
data. Statistical heterospectroscopy is an extension of statistical total 
correlation spectroscopy for the co-analysis of multispectroscopic 
data sets, which have been acquired from multiple samples. The 
statistical heterospectroscopy approach, originally developed for NUR 
and mass-spectrometry correlation, can be used if any two or more 
independent spectroscopic data sets from any source are available for 
any sample cohort’. 

Anew and exciting approach that uses mass spectrometry is the 
analysis of smoke from a cauterization device used in surgery to 
identify the exact type of tissue being investigated®"5. By using a 
combination of new sampling methods, high-speed mass spectrometry 
and chemometrics for classification purposes, it is possible to identify 
different types of tissue in real time during a surgical procedure 
a development known as the intelligent knife (i-knife). A parallel 
a 
a 
a 


pplication of mass spectrometry is to locate molecules within a sample 
s an imaging technique by using ionization methods based on laser 
blation from the tissue surface®. In combination with chemometric- 
enhanced information recovery, this has led to the possibility of an 
augmented histological assessment). 


development of modelling techniques, chemometrics and ways to identify 
new biomarkers’* >. Butas the technology and modelling platforms have 
matured and improved, there has been a shift towards the implementation 
of clinical studies. Hence, in this Review, we focus on outlining specific 
areas that have the potential to have significant impacts on translational 
medicine and clinical delivery in the hospital environment. These include 
screening patients with established diseases for the detection of new bio- 
markers to aid in clinical classification. We are now ina position to deliver 
asystems-biology framework for complex clinical problems that are often 
compromised by extreme gene-environment variation. Early studies that 
applied metabolic profiling to clinical conditions were largely focused 
on identifying biomarkers, and were typically hindered by small group 
sizes and technical constraints. However, these studies paved the way for 
the wider use of metabotyping approaches to further the understand- 
ing of systemic disease, as well as diagnostic and prognostic biomarker 
discovery. Indeed, the general principles of metabotyping approaches 
had already been well-demonstrated by the 1980s, particularly for overt 
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metabolic disease such as that seen in type 2 diabetes’* or inborn errors of 
metabolism (based on 'H-NMR-spectroscopy-derived urine and serum 
profiles)'”. It was also shown quite early on that biochemically relevant 
systemic information could be recovered, such as the increase in serum 
alanine and the reduction in branched-chain amino acids (corresponding 
to decreased amino-acid gluconeogenesis and increased ketogenesis) that 
followed insulin withdrawal in people with type 2 diabetes. In addition, 
through time modification of plasma lipid and lipoprotein profiles, thera- 
peutic optimization could also be monitored in this way’®. The field has 
expanded to encompass epidemiological and population-scale studies’®, 
and to take into account some of the complications of diabetes — such as 
vascular lesions — that lead to premature death”. Detailed cross-species 
metabolic analysis has uncovered information on the potential mecha- 
nisms underlying type 2 diabetes that relate to nucleotide metabolism, 
and to modulation of N-methylnicotinamide — which is conserved 
across rats, mice and humans”. In the case of type 1 diabetes, a metabolic 
dysregulation of lipid and amino-acid metabolism was found to precede 
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Figure 2 | Local and global metabolic interactions in relation to sampled 
compartments, fluids and their properties. In clinical settings, it is only 
possible to obtain limited ‘sampling islands’ for metabolic measurements. 
Within the body, there is a complex and dynamic continuum of metabolic 
interactions, from the subcellular level through multiple layers of 
biomolecular organization up to the whole supraorganism system, including 
the symbiotic microbiome components. At the system tissue and organ level, 
multicellular interactions occur through time and space through the secretion 
of biochemical products, as well as hormone and neurological control of 
function and physiological homeostatic regulation. Environmental and 
exogenous factors, including lifestyle, diet, drug therapy and the microbiota, 
all influence metabolism. For example, the microbiota, as part of the 
supraorganism, has a commensal and symbiotic relationship with tissues of 
the gut; the body’s interactions with pathogens and parasitic organisms, as 
well as quorum sensing, also have a role. At the intercellular level, signalling 
molecules and transporter systems coordinate functions and metabolic flux 
between cells. Finally, within the cell itself, enzymes require specific substrates 
and cofactors; biochemical conversions in organelles are topographically 
constrained, and the metabolome requires specific functional pathway units. 
The two most accessible components are urine and plasma, but they carry 
different system information sets as a result of these different regulation 


onset of the disease”, whereas a panel of five branched-chain amino acids 
was found to be predictive of type 2 diabetes”. Unsurprisingly, insulin 
resistance, and both type 1 and type 2 diabetes, have been the subject of 
intensive metabolic investigation for many years, and the contributions 
of metabonomics and metabolomics have been reviewed extensively”. 
In the field of cancer (reviewed on page 364 of this issue), most stud- 
ies were originally centred on extracts of the tumour tissue itself, and 
"H-NMR spectroscopy coupled to pattern recognition methods showed 
the ease with which discrete cancer-tissue types could be discriminated™. 
Perhaps what is now more clinically relevant is the identification of poten- 
tial cancer biomarkers in biofluids, including successful mapping of 
plasma ovarian cancer signatures — which are characterized by an altered 
pattern of ceramides and lysophospholipids, increased ketone bodies, and 
decreased alanine, valine and low-density lipoproteins~”*. Patients with 
lung cancer have also been distinguished from a control group by their 
low urinary levels of hippurate and trigonelline, together with elevated 
p-3-hydroxyisovalerate, a-hydroxyisobutyrate and N-acetylglutamine”. 
An inverse relationship between endometrial cancer and the metabo- 
lites stearic acid and serum acylcarnitines has been identified”, and 
dysregulation of acylcarnitines also has a role in kidney cancer”. Some 
metabonomic studies have found that models built on serum metabolite 
profiles perform better in terms of sensitivity than conventional markers, 
such as carcinoembryonic antigen in colorectal cancer” for predicting 
early-stage tumours (stage 0-2). Other cancer metabonomic studies — 
including those on ovarian and breast cancers*™””, and renal-cell carci- 
noma™ — have shown promise in differentiating early- from late-stage 
tumours. Perhaps even more exciting than the diagnostic potential is the 
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and control systems. Because urine is stored in the bladder, it represents 
time-averaged data and has the following physiological characteristics: 

a variable pH, ionic strength and osmolarity; a high dielectric constant; 

an extreme dynamic concentration range (more than 10"); thousands of 
molecules of less than 1 kDa; metal complexes and supramolecular aggregates; 
many small proteins; high enzyme activities in pathological states; and a 
dynamically reactive matrix. Plasma, however, provides snap-shot data and 
has the following characteristics: relatively constant pH, ionic strength and 
osmolarity, a lower bulk dielectric constant, a high dynamic concentration 
range (more than 10°), hundreds of molecules both smaller and larger than 

1 kDa; metal complexes and supramolecular complexes; a multi-compartment 
multi-diffusional matrix; and many large proteins and protein complexes. 
There are also a series of specialized secretory and pathological fluids that can 
be sampled and give, on spectroscopic analysis, more localized biochemical 
information specific to tissue injury. Specialized fluids are cerebrospinal, 
thyroid, saliva (sublingual, parotid and submaxillary), respiratory washings, 
gastric, bile, pancreatic, amniotic, follicular, milk, seminal vesicle, prostatic, 
epididymal and semen. Artificial fluids include bronchiolar lavage fluid, 
peritoneal dialysates, haemodialysates, faecal water, rectal dialysates, cell 
extracts and cell supernatants. Pathological fluids include ascites, pus, cystic 
fluid and effusions (malignant and infective). 


ability of metabolic models to predict clinical outcomes for certain can- 
cers. Micrometastases were predicted in a study of people with breast 
cancer, in which patients who went on to develop metastases were shown 
to have higher levels of plasma glucose, proline, lysine, phenylalanine 
and N-acetylcysteine and lower levels of lipids”. Similarly, a recent study 
showed that — based on pretreatment serum samples for 500 women with 
metastatic breast cancer — time to progression, overall survival and treat- 
ment toxicity could be predicted from the serum levels of phenylalanine 
and glutamate (higher) and glucose (lower) for a subset of patients who 
were HER2 positive, although correlation between pre-treatment serum 
profile and outcome was not possible for the general trial population™. 
As for diabetes, there has been an eruption of metabolic research on the 
processes of onset and progression of tumour development over the past 
decade, as well as identification of cancer biomarkers, and this has been 
comprehensively reviewed with respect to cellular biochemistry’ 5 thera- 
peutic target discovery” and tumour typing*””’. There have also been 
advances in metabotyping as a tool for basic cardiac research’ in the 
ability to predict cardiovascular events in baseline profiles of individuals 
at risk of coronary artery disease”; as well as for understanding the origins 
of pathology, including the complex environmental and non-infectious 
microbiological triggers of disease”’. For the most part, diagnostic meth- 
ods have been developed for serum and urine. However, for certain classes 
of disease — such as lung disease — gas chromatography methods for 
characterizing volatile components of exhaled breath condensate have 
shown considerable promise. For example, children with asthma and 
allergic rhinitis were distinguished from controls based on the alkane 
and aldehyde composition of breath condensate”; and differentiation 
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between each of the stages from 1 and 3 in chronic obstructive pulmo- 
nary disease identified ketones, methyl-branched alkanes and alcohols, 
in exhaled breath among other compounds”. An advantage of metabolic 
profiling is that multiple compartments and fluids can be analysed to 
give complementary information on systemic dysfunction. Thus, the 
metabolic signature of lung disease also extends to urine and serum. Uri- 
nary concentrations of the tricarboxylic-acid (TCA) cycle intermediates 
a-ketoglutarate, succinate, fumarate and cis-aconitate were found to be 
differential between people with stable and unstable asthma”, whereas 
decreased serum lipoproteins and N-dimethylglycine, and increased glu- 
tamine, 3-methylhistidine and branched-chain amino acids have been 
associated with chronic obstructive pulmonary disease™. 

Human metabolic phenotypes, and multiple disease processes, are 
highly dependent on gut-microbial activity. An emerging area for meta- 
bolic profiling is the characterization of the functional properties of the 
gut microbiome. This is the combined genomic composition (more 
than 3.3 million genes”) of several thousand species that make up the 
gut microbiota, and varies with age and between human populations”. 
Abnormalities of the gut microbiome have been associated with a remark- 
able variety of human conditions, ranging from obesity and diabetes to 
autoimmune diseases and neuropsychiatric disorders”. Metabotyping 
of inflammatory bowel diseases has been carried out on urine, plasma 
and faecal samples both to characterize the metabolic consequences of 
ulcerative colitis and Crohn’s disease and to identify disease-induced 
changes in the metabolites deriving from the gut microbiota”. Dif- 
ferences in the levels of metabolites, such as the short-chain fatty acids, 
4-cresyl sulphate and hippurate are indicative ofa perturbed microbiota 
in inflammatory bowel conditions, whereas altered levels of TCA-cycle 
intermediates and amino acids reflect a shift in energy balance. Crucially, 
the activities of the gut microbiota influence the host metabolic pheno- 
types” through a series of complex signalling axes that connect to multiple 
host compartments, including the liver and brain”, as well as the immune 
system”. The metabolic axes involve bile-acid (which are themselves 
heavily metabolized by the microbiota) dependent signalling, binding 
to a variety of nuclear receptors, such as those in the liver, which in turn 
affect host gene-expression profiles”’. There are also signalling axes that 
involve gut-microbe-generated short-chain fatty acids (from colonic fer- 
mentation of polysaccharides and oligosaccharides), aromatic amines and 
acids (from aromatic amino-acid and protein putrefaction in the distal 
colon™), as well as gut-microbe-derived links (through the endocannabi- 
noid system) which affect host adipogenesis”, and ultimately multiple 
CNS signalling axis connections. Children who are diagnosed with the 
neurobehavioural disorder autism manifest different urinary metabolite 
phenotypes compared with controls, including increased excretion of 
gut-microbial metabolites (such as phenylacetylglutamine and 4-cresyl 
sulphate) as well as altered amino-acid and nicotinic-acid profiles”. The 
microbiome influences metabolism from birth, and early events in the 
development of the microbiome-host signalling axes can leave a lasting 
metabolic imprint. Babies that are born before 37 weeks gestation have a 
higher risk of developing metabolic syndrome and end-stage renal failure 
than those born at full term. Individuals who are born preterm can still 
be differentiated from those born at full term when they reach adult- 
hood by profiling of the microbial degradation products choline, bile acids 
and acetylated glycoproteins™. These metabolic signalling axes may form 
the basis of drug discovery and other therapeutic strategies designed to 
operate either directly on the microbiome” or indirectly through interac- 
tions with host metabolic pathways and immune signalling”. Indeed, the 
microbiome may offer more druggable targets than the human genome, 
but determining this lies in future research and will be highly dependent 
on the successful application of metabotyping approaches to help eluci- 
date these complex microbial—host symbiotic interactions. 


Phenotyping patient journeys 

All patients who enter the diagnostic environment undergo a series of 
tests that is designed to characterize and stage disease, as well as to select 
suitable therapies, which then result in either a ‘good’ or ‘bad’ clinical 
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outcome for the individual — a process known as the patient journey. By 
use of the advanced metabotyping methods already described, coupled 
to classic clinical diagnostic criteria, it is now possible to conceptualize 
a phenotypically enhanced patient journey in which multiple technol- 
ogy platforms are deployed throughout the patient-handling pipeline”. 
The first level of deployment is to create enhanced diagnostic biomarker 
profiles at each stage of the journey to assess how the patient responds 
to therapies, as well as to form differential diagnoses. However, by using 
pharmacometabonomic approaches”, it is also possible to consider the 
sum of many patient journeys, and to engage in prospective or prog- 
nostic analysis of patient outcomes. In pharmacometabonomic studies, 
pre-intervention profiles of biofluids, such as urine or plasma, are used 
to create mathematical models of therapeutic interventions (using cross- 
validated models (Box 1)) so that prognostic outcomes can be judged. 
Such studies have predicted xenobiotic hepatotoxicity in experimental 
animal models through pre-intervention urinary profiling”, and pre- 
dicted drug (paracetamol) metabolism in humans using NMR-based 
spectroscopic profiling of urine (also demonstrating a complex connec- 
tion between gut microbes and drug metabolic fate)*. An example of 
a successful pharmacometabonomic patient-stratification model is the 
prediction of response to capecitabine therapy in patients with colorectal 
cancer, whereby high levels of serum polyunsaturated fatty acids were 
predictive of drug toxicity™. The approach is also relevant to understand- 
ing surgical interventional outcomes (discussed later)", and is generally 
well-suited to modelling multiple longitudinal congruent patient journeys 
and for the abstraction of patient stratification information to help inform 
decision-making. 

In particular, patient-journey phenotyping lends itself to scalable and 
translatable models that can be applied to any acute hospital admission, 
covering a variety of disease states (Fig. 3). Furthermore, this generalized 
patient-journey phenotyping protocol can be applied in a drug-devel- 
opment testing environment in which we envisage the development of a 
phenotypically augmented clinical trial. Standard clinical-trial informa- 
tion would be supplemented by molecular data, leading to a significant 
enhancement of the mechanistic background to observed responder or 
non-responder phenotypes that are seen in many clinical trials. Nowhere 
in the clinic is the ability to deliver a rapid prognostic metric of clinical 
condition more important than in the critical-care setting. Here, a gain 
in minutes or hours when choosing and implementing a therapeutic 
strategy can mean the difference between life and death, and poor deci- 
sions carry substantial financial cost. Recent studies suggest that meta- 
bolic profiling tools can augment the identification of sepsis based on a 
set of serum acylcarnitines and glycerophosphatidylcholines measured 
by liquid-chromatography-mass-spectrometry™, whereas serum levels 
of triacylglycerides, glucose and glutamate are predictive of survival in 
patients who experienced trauma”. 


Mapping metabolic phenotypes during surgery 

Surgery is, by definition, the most personalized of all therapeutic options 
delivered in current clinical practice, yet there is a lack of molecular diag- 
nostic and prognostic instruments that are required for modern preci- 
sion-based surgical practice’. As a result, clinical decisions are made on 
the basis of calculations of surgical risk using single or univariate biomark- 
ers or data from retrospective logistic regression models. Surgery also 
poses other challenges. First, clinical diagnostic phenotypes in surgery 
are often hard to define precisely and tend to be inadequate for omics- 
based research. The best example of this is found in the management 
of patients who have experienced major trauma, in whom injuries are 
highly heterogeneous and exert a variable and often unpredictable inter- 
individual effect”. Secondly, patients who undergo modern surgery are 
exposed to a large and variable environmental load of operative drugs 
and bacteria, as well as nutritional optimization strategies, which cannot 
be objectively measured by standard biochemical assays®. While surgical 
outcomes have improved, the population has aged, and patients increas- 
ingly have a more diverse range of co-morbidities, with higher associ- 
ated rates of malnutrition”, polypharmacy” and medical interventions. 
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Finally, the operating environment presents a unique and stringent set 
of requirements for molecular platforms. Not only must they be able to 
objectively quantify the large number of rapidly changing environmen- 
tal influences on patient outcome during a single operative journey, but 
they must be rapid, highly reliable, inexpensive and physically able to 
function under the conditions ofan operation. Longitudinal metabotyp- 
ing describes a metabonomic expression of an individual's metabolism, 
the trajectory of which can be accurately measured as a patient passes 
through a multivariate operative journey”. These data can be integrated 
into current clinical data sets to augment surgical decision-making, and 
deviations from a personalized trajectory can be used to detect early clini- 
cal deterioration, predict risk or guide intervention. There is now clini- 
cal evidence that this approach has merit. Mass-spectrometry-targeted 
analysis of 69 serum metabolites in a large population of patients (478 
patients) who were undergoing cardiac surgery was able to accurately 
predict a poor operative outcome over a mean follow-up period of 4.3 + 
2.4 years". Short-chain dicarboxylacylcarnitines, ketone-related metabo- 
lites and short-chain acylcarnitines were all independently predictive of 
an adverse outcome after multivariate adjustment. However, personalized 
or stratified approaches to health care must not only focus on mammalian 
biology. Initial work suggests that a metabonomic strategy is also able to 
predict early-onset systemic inflammatory response syndrome (SIRS) 
in those patients who are exposed to major trauma. Partial-least-squares 
discriminant analysis was also able to clearly distinguish between patients 
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with SIRS and multi-organ dysfunction syndrome, according to variation 
in levels of carbohydrate, amino acids, glucose, lactate, glutamine signals, 
fatty acyl chains and lipids”. Finally, metabonomics has had a particular 
impact on experimental models of kidney transplant surgery, in which it is 
being assessed for its ability to predict graft failure”, and end-organ drug 
toxicity” and to assess hypoxic injury in cadaveric specimens. It has also 
been widely used in experimental models ofliver” and gut” transplanta- 
tion. Metabonomics may have a significant impact on transplant surgery, 
in which there is a demand for rapid molecular diagnostics within the 
operating room to predict graft suitability and survival. 

The British surgeon Joseph Lister first described the importance of 
antisepsis more than 100 years ago, yet the wider role of gut bacteria in 
surgical health is only just being recognized and little is known about 
the importance of commensal bacteria to post-operative recovery. This 
is largely because clinicians rely on culture-dependent analysis to glean 
useful information about pathogenic organisms, or complex and dynamic 
ecosystems. Rapid, culture-independent analytical approaches for the 
detection of species-specific changes in pathogenic or commensal bac- 
teria are therefore of particular use in surgery. However, clinicians need 
to know not only which bacteria are present, but also what they are doing. 
Global metabotyping therefore extends beyond the model of culture- 
independent analysis — by allowing the exploration of the functional 
and symbiotic biochemical relationship between humans and microbiota 
— as patients progress through their recovery phase. For example, surgical 
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Figure 3 | Phenotyping the patient journey and phenotypically augmented 
clinical trials. Patients enter the diagnostic environment either through 
community admission, electively, as an acute case or as an emergency. At any 
point in the patient journey, there are multiple opportunities for metabolic 
phenotyping using technologies such as mass spectrometry or NMR 
spectrometry. Sections of these samples can also be stored in biobanks prior to 
analysis for use in future research. Deviation from recovery can occur at any 
point in the patient journey, and samples can be taken again. This analysis can 
be used to enhance differential diagnosis, therapeutic responses and long-term 
outcomes of therapy. Taking biosamples also provides real-time diagnosis and 
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prognosis to enhance clinical decision-making”. Before admission to hospital, 
this can be used for disease prevention and after intervention to optimize 
recovery. However, current biomarkers have left several areas of unmet 
clinical need in personalized prevention and therapeutic strategies, and in the 
delivery of sensitive and specific diagnostics and prognostic platforms for both 
surgical and medical diseases. By modelling congruent longitudinal journeys 
using pharmacometabonomic approaches” it is possible to derive prognostic 
biomarker predictors and risk phenotypes that allow patient stratification, or 
that can give mechanistic information relating to therapeutic responder or 
non-responder status. 
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bypass of the foregut is increasingly used to treat obesity and its metabolic 
complications, such as type 2 diabetes”’. There are now convincing ani- 
mal and human data to suggest that this surgical bypass fundamentally 
disrupts the distal gut microbiome, and that the subsequent disruption 
to the metabolome can be objectively measured using both NMR and 
mass-spectrometry-based approaches”*”. Following bariatric surgery, 
there is a marked dysregulation in the gut microbiota from a landscape 
dominated by Firmicutes and Bacteroidetes to one dominated by proteo- 
bacteria”**’. Metabolic profiles reflect this shift in the community with a 
persistent alteration in urinary, serum and faecal levels of cresols, indoles 
and biogenic amines””’. This suggests that personalized biomarkers for 
the prediction of long-term weight loss will have to account for the gut 
microbiome, and that the long-term health consequences of permanently 
altered enteric flow have yet to be fully defined”. The gut microbiome 
is influenced by nearly all medical peri-operative therapeutic and risk- 
reduction strategies (for example, broad-spectrum antibiotic use), yet the 
effect of this on operative morbidity is not yet understood. Therefore, the 
clinical deployment of metabonomic technologies that provide real-time 
functional insight into the human-microbiome surgical health axis dur- 
ing surgery will have a significant effect. It is likely that this will be crucial 
to older people, who are the most vulnerable — as well as the fastest grow- 
ing — group of patients who have surgery. It is well-established that stable 
age-related changes occur in gut-microbiome function®”. Recent data sug- 
gest the presence of an age-related diet-microbiome-health axis, in which 
the gut microbiome is markedly different in patients living in residential 
care than those living in the community”. This could be linked to dietary 
changes, as well as to objective measures of frailty and poor health. The 
same is probably true of patients who are subjected to long stays in hos- 
pital, suggesting that measures of gut health are urgently required, and 
that new models of pre-operative nutritional optimization are needed 
that include metabonomic measures of gut health. Moreover, surgical 
excision of the colon may have much wider metabolic consequences for 
older people than currently understood. 

Equally, there is a need for real-time interventional precision biomark- 
ers that improve the quality and efficacy of the surgical excision itself. 
Oncological surgery is still based on Halsted’s principles of oncological 
clearance, and the technique has remained largely unchanged since it 
was introduced. Clearance is typically based on arbitrary measurements 
that are not objective and are often inadequate, with potentially serious 
implications for the patient. For example, more than 20% of breast cancers 
require re-excision for positive margins*’. Metabonomics permits near 
real-time analysis with minimal sample preparation on very low volume 
samples, and several analytical platforms offer a potential solution to this 
problem. Magic-angle-spinning (MAS) 'H-NMR spectroscopy is able to 
robustly determine the difference between benign and malignant tissue 
from patients with breast and colon cancer with a high degree of sensitiv- 
ity and specificity. This has been extensively used in the analysis of brain 
tumours to differentiate between malignant tumour types”, and these 
detailed biochemical profiles can then be related to the lower-resolution 
spectra obtained in vivo using NMR spectroscopy”. This significantly 
improves MRI-based characterization of grade IV glioblastomas, metas- 
tases, medulloblastomas, lymphomas and glial tumours. Low concentra- 
tions of citrate and high concentrations of choline-containing compounds 
are metabolic characteristics that have been observed by NMR spectros- 
copy of prostate-cancer tissue. A similar approach has, therefore, been 
used in prostate cancer, in which this technique demonstrated an overall 
accuracy of between 93% and 97% for detecting the presence of prostate- 
cancer lesions”. 

Ina study* to find regulatory genes with the potential for targeted 
therapies, the gene products acetylcitrate lyase and m-aconitase were both 
found to be predictive of significantly reduced citrate level. In the same 
study, which used 133 fresh-frozen samples from 41 patients undergo- 
ing radical prostatectomy, the two genes whose expression most closely 
accompanied the increase in choline-containing compounds were 
PLA2G7and CHKA. Thus, MAS-NMR spectroscopy, when incorporated 
into a systems-level analysis, provides new insight into cancer disease 
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mechanisms, and it is conceivable that MAS-NMR spectroscopy can be 
performed within a 10-20-minute time frame if facilities are co-located 
near to, or within, clinical environments. Therefore, this analysis has 
translational potential as a clinical resource for rapid diagnostics within 
either the outpatient clinic or the operating room environment. However, 
few technologies exist that are significantly faster than frozen-section his- 
topathological analysis for the intra-operative assessment of tumour mar- 
gins. Mass spectrometry has been used to characterize intact biological 
tissues for more than 30 years, but the field gained real momentum in the 
late 1990s with the advent of matrix-assisted laser desorption/ionization 
(MALDI) imaging analysis of histological tissue sections. Mass-spectrom- 
etry imaging studies, including MALDI, have revealed the molecular fin- 
gerprint of tissues with metabolic constituents, that lipids and proteins 
have a high histological specificity, and identified a number of prognostic 
markers (both single and complex). Mass spectrometry imaging has been 
suggested as an alternative to frozen-section histology; however, the time 
demand of this type of analysis is currently several hours per sample, 
even at coarse (100 um) resolution”. Multivariate, chemically augmented 
histology of this type therefore has two significant benefits over current 
histological staining methodologies. First, it provides instantaneous tissue 
identification, which allows interactive and feedback-controlled surgical 
and diagnostic interventions. Second, there is no inter-operator reproduc- 
ibility of histological data, which can be exacerbated by low quality (for 
example, smeared) histological sections. 

Incontrast to mass-spectrometry imaging, rapid evaporative ionization 
mass spectrometry (REIMS) was developed exclusively for in situ analysis 
— even for the in vivo chemical characterization of tissues”. REIMS was 
developed as a result of the discovery that all surgical instruments that use 
thermal evaporation approaches (including electrosurgery, laser surgery, 
radiofrequency ablation and microwave ablation) ionize the molecular 
constituents of biological tissues. The subsequent combination of surgical 
instruments with mass spectrometry has yielded an approach capable of 
identifying tissues and their pathological subtypes during surgical or diag- 
nostic interventions”. Data generated from REIMS is strikingly similar 
to other imaging mass spectrometry (MALDI or desorption electrospray 
ionization) data”. Although REIMS is a more suitable technology for 
the surgical environment than MALDI imaging, the latter guarantees a 
high histological specificity with quantitative, and potentially automated, 
histopathological analysis of tissue specimens. Recent results suggest that 
the REIMS technology can be successfully implemented in the surgical 
environment”. Although the spatial resolution of REIMS is limited by 
the hand-held nature of the probe and its geometry, 100 um is generally 
achievable with rapid (less than 0.9 s) feedback to the operator. Initial anal- 
ysis of adenocarcinoma of the gastrointestinal system and lungs together 
with hepatic metastases, primary tumours of the liver, pre-cancerous 
degenerations of colon mucosa and sentinel lymph-node mapping has 
shown more than 95% concordance with classical histology and a less than 
1% false-negative rate”. The technology has also been successfully trialled 
in neurosurgical brain-tumour excision of astrocytomas, meningiomas 
and metastatic brain tumours, as well as healthy brain tissue, with simi- 
lar sensitivity and specificity’. Thus, mass-spectrometry metabonomics 
provides real-time, descriptive in vivo data that are directly comparable 
with post-interventional histological analysis. 


Concluding remarks 

Systems-biology approaches have allowed a deeper understanding 
of the metabolic and physiological function of the human symbiotic 
‘supraorganism (which includes the sum total of the eukaryotic 
and prokaryotic genomes required for human health). Disorders 
of supraorganism function underlie the aetiology of many modern 
non-communicable diseases, and metabolism-based mechanistic 
understanding of these processes therefore has much to offer personalized 
health-care systems of the future. Super-system surgery is the influence 
of surgical interventions or trauma on this complex symbiotic network, 
and the resultant time-dependent disruption in microbial-mammalian 
co-metabolic pathways. To take full advantage of theories such as this, 
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new systems-medicine technologies for surgical biomarker and drug 
target discovery are required, and these are already being developed. 
Metabonomics is used to probe the real-world nature of biochemical 
functionality and is sensitive to both gene and environmental influences; 
it is, therefore, likely to be more practical than gene-based measurements 
of responses to therapy. Nevertheless, a multi-omics approach can provide 
more information than a single one. Thus, it is currently possible to 
integrate heterogeneous data sources (for example, metagenomic and 
metabonomic, transcriptomic or proteomic data sets) to provide a 
complete top-down overview of complex disease states. We affirm that 
metabonomics has the power to influence clinical decision-making 
in the hospital environment for both medical and surgical treatments. 
The exquisite sensitivity of metabolic profiles to different diseases and 
treatment options means that computer models can be generated to 
aid decision-making processes for the medical practitioner. Moreover, 
by recording a patient’s metabotype as treatment progresses, it will be 
possible to monitor the beneficial or detrimental effects of treatment, 
so that, for example, drug regimes or diet can be altered and a prognosis 
of disease outcome can be made. However, future systems must be 
able to link omics-level data sets and clinical databases seamlessly, and 
incorporating electronic health records into experimental data sets would 
seem to be an essential, although formidable, task for the future. = 
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Analyses of pig genomes provide insight 
into porcine demography and evolution 
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For 10,000 years pigs and humans have shared a close and complex relationship. From domestication to modern 
breeding practices, humans have shaped the genomes of domestic pigs. Here we present the assembly and analysis of 
the genome sequence of a female domestic Duroc pig (Sus scrofa) and a comparison with the genomes of wild and 
domestic pigs from Europe and Asia. Wild pigs emerged in South East Asia and subsequently spread across Eurasia. Our 
results reveal a deep phylogenetic split between European and Asian wild boars ~1 million years ago, and a selective 
sweep analysis indicates selection on genes involved in RNA processing and regulation. Genes associated with immune 
response and olfaction exhibit fast evolution. Pigs have the largest repertoire of functional olfactory receptor genes, 
reflecting the importance of smell in this scavenging animal. The pig genome sequence provides an important resource 
for further improvements of this important livestock species, and our identification of many putative disease-causing 
variants extends the potential of the pig as a biomedical model. 


The domestic pig (Sus scrofa) is a eutherian mammal and a member of 
the Cetartiodactyla order, a clade distinct from rodent and primates, 
that last shared a common ancestor with humans between 79 and 
97 million years (Myr) ago’* (http://www.timetree.net). Molecular 
genetic evidence indicates that Sus scrofa emerged in South East Asia 
during the climatic fluctuations of the early Pliocene 5.3-3.5 Myr ago. 
Then, beginning ~ 10,000 years ago, pigs were domesticated in multi- 
ple locations across Eurasia* (Frantz, L. A. F. et al., manuscript 
submitted). 

Here we provide a high-quality draft pig genome sequence developed 
under the auspices of the Swine Genome Sequencing Consortium*”, 
established using bacterial artificial chromosome (BAC)° and whole- 
genome shotgun (WGS) sequences (see Methods and Supplementary 
Information). The assembly (Sscrofal0.2) comprises 2.60 gigabases 
(Gb) assigned to chromosomes with a further 212 megabases (Mb) in 
unplaced scaffolds (Table 1 and Supplementary Tables 1-3). 


Genome annotation 


A de novo repeat discovery and annotation strategy (Supplemen- 
tary Fig. 8) revealed a total of 95 novel repeat families, including: 
5 long interspersed elements (LINEs), 6 short interspersed elements 
(SINEs), 8 satellites and 76 long terminal repeats (LTRs). The relative 
content of repetitive elements (~40%, Supplementary Figs 9 and 10) 
is lower than reported for other mammalian genomes. The main 
repetitive element groups are the LINE1 and glutamic acid transfer 
RNA (tRNAS")-derived SINEs or PRE (porcine repetitive element). 
The expansion of PRE is specific to the porcine lineage. Phylogenetic 
analysis of LINE] and PRE (Supplementary Figs 13 and 14) indicates 
that only a single lineage of each is currently active and that the main 
expansion of both LINE1 and PRE occurred in the first half of 
the Tertiary period. Smaller expansions, particularly in LINE1, have 
occurred since, but recent activity is very low (Supplementary 
Information). 

Annotation of genes, transcripts and predictions of orthologues 
and paralogues was performed using the Ensembl analysis pipeline’ 
(Table 1 and Supplementary Figs 3-7). Further annotation for 
non-protein-coding RNAs (ncRNAs) was undertaken with another 
analysis pipeline (Supplementary Information and Supplementary 
Table 4). 


Evolution of the porcine genome 
Evolution of genes and gene families 
To examine the mutation rate and type of protein-coding genes that 
show accelerated evolution in pigs, we identified ~9,000 as 1:1 ortho- 
logues within a group of six mammals (human, mouse, dog, horse, 
cow and pig). This orthologous gene set was used to identify proteins 
that show accelerated evolution in each of these six mammalian 
lineages (Supplementary Information). The observed number of syn- 
onymous substitutions per synonymous site (dS) for the pig lineage 
(0.160) is similar to that of the other mammals (0.138-0.201) except 
for the mouse (0.458), indicating similar evolutionary rates in pigs and 
other mammals. The observed dN/dS ratio (ratio of the rate of non- 
synonymous substitutions to the rate of synonymous substitutions) of 
0.144 is between those of humans (0.163) and mice (0.116), indicating 
an intermediate level of purifying selection pressure in the pig. Genes 
showing increased dN/dS ratios in each lineage were analysed using 
DAVID* to examine whether these rapidly evolving genes were enriched 
for specific biological processes. Most lineages show different fast- 
evolving pathways, but some pathways are shared (Fig. 1). 
Immune genes are known to be actively evolving in mammals 
Because many immune genes were not included in the analysis 
of 1:1 orthologues, we examined a randomly selected subset of 158 
immunity-related pig proteins for evidence of accelerated evolution 
(Supplementary information). Twenty-seven of these genes (17%) 


9,10 


Table 1 | Assembly and annotation statistics 

Assembly Placed Unplaced Annotation* 

Total length 2,596,639,456 211,869,922 21,640 protein-coding 
genes 

Ungapped length 2,323,671,356 195,490,322 380 pseudogenes 

Scaffolds 5,343 4,562 2,965 ncRNAst 

Contigs 73,524 168,358 197,675 gene exons 

Scaffold N50 637,332 98,022 26,487 gene transcripts 

Contig N50 80,720 2,423 


* Numbers refer to the annotation performed by Ensembl (release 67). Results of an independent 
annotation by the NCBI can be obtained from http://www.ncbi.nIm.nih.gov/mapview/stats/ 
BuildStats.cgi?taxid=9823&build=4&ver=1. 

+ An improved ncRNA annotation with 3,601 ncRNAs and structured elements is available as a separate 
track in Ensembl version 70 and for download from http://rth.dk/resources/rnannotator/susscr102. 
N50, 50% of the genome is in fragments of this length or longer. 
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Figure 1 | Phylogeny of the six mammals used in the dN/dS analysis. KEGG 
pathways with genes that show accelerated evolution for each of the six 
mammals used in the dN/dS analysis. The bar charts show the individual dN/dS 
and dS values for each of the six mammals. The dN/dS and dS values refer to the 
time period of each of the six individual lineages. The number of proteins that 
show significantly accelerated dN/dS ratios in each lineage varies from 84 in the 
mouse to 311 in the pig lineage. Pathways significantly (P < 0.05) enriched 
within this group of genes are also shown with the number of genes shown in 
brackets. HPI, Helicobacter pylori infection. 


demonstrated accelerated evolution (Supplementary Table 8). A 
parallel analysis of 143 human and 145 bovine orthologues revealed 
very similar rates of evolution (18% in human and 12% in cattle, 
respectively). Using a branch-site analysis, we detected accelerated 
evolution of amino acids in PRSS12, CD1D and TRAF3 specific to 
pig (positive selection on pig branch), as well as amino acids in 
TREM1, IL1B and SCARAS specific to pig and cow (positive selection 
on the cetartiodactyl branch). 

Further analysis of porcine immune genes (Supplementary Table 5) 
revealed evidence for specific gene duplications and gene-family 
expansions (Supplementary Tables 6 and 7). The analysis of this second 
cetartiodactyl genome indicates that some expansions are cetartiodactyl- 
specific (cathelicidin) whereas others are ruminant/bovine-specific 
(B-defensins, C-type lyzozymes) or potentially porcine-specific (type 
I interferon, 6 subfamily). 

Pigs have at least 39 type I interferon (IFN) genes, which is twice the 
number identified in humans and significantly more than in mice. We 
also detected 16 pseudogenes in this family. Cattle have 51 type I IFNs 
(13 pseudogenes), indicating that both bovine and porcine type I IFN 
families have undergone expansion. This is particularly important 
for interferon subtypes 6 (IFND), @ (IFNW) and t (IFNT); pigs 
and cattle are evolving species-specific subtypes of IFND and IFNT, 
respectively. Both species are expanding the IFNW family and share 
many more IFNW isoforms than other species. Thus, expansion of 
interferon genes is not ruminant-specific as proposed earlier’®, although 
duplication within some specific sub-families seems to be either bovine- 
or porcine-specific. 

Within the immunity-related genes annotated, we found evidence 
for duplication of six immune-related genes: IL1B, CD36, CD68, 
CD163, CRP and IFIT1, and one non-immune gene, RDH16. The 
CD36 gene is also duplicated in the bovine genome, whereas the 
IL1B gene duplication, where evidence for a partial duplication was 
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reported previously"', is unique in mammals. Other key immune 
genes in the major histocompatibility complex, immunoglobulin, 
T-cell-receptor and natural killer cell receptor loci have been charac- 
terized in detail'*””? (Supplementary Information). 

Another significant porcine genome expansion is the olfactory 
receptor gene family. We identified 1,301 porcine olfactory receptor 
genes and 343 partial olfactory receptor genes”. The fraction of pseu- 
dogenes within these olfactory receptor sequences (14%) is the lowest 
observed in any species so far. This large number of functional olfac- 
tory receptor genes most probably reflects the strong reliance of pigs 
on their sense of smell while scavenging for food. 


Conservation of synteny and evolutionary breakpoints 
Alignment of the porcine genome against seven other mammalian 
genomes (Supplementary Information) identified homologous syn- 
teny blocks (HSBs). Using porcine HSBs and stringent filtering cri- 
teria, 192 pig-specific evolutionary breakpoint regions (EBRs) were 
located. The number of porcine EBRs (146, Supplementary Table 11 
and Supplementary Fig. 16) is comparable to the number of bovine- 
lineage-specific EBRs (100) reported earlier using a slightly lower 
resolution (500 kilobases (kb)), indicating that both lineages evolved 
with an average rate of ~2.1 large-scale rearrangements per million 
years after the divergence from a common cetartiodactyl ancestor 
~60 Myr ago’. This rate compares to ~ 1.9 rearrangements per million 
years within the primate lineage (Supplementary Table 11). A total of 
20 and 18 cetartiodactyl EBRs (shared by pigs and cattle) were detected 
using the pig and human genomes as a reference, respectively. 

Pig-specific EBRs were enriched for LTR endogenous retrovirus 1 
(LTR-ERV1) transposons and satellite repeats (Supplementary 
Table 12), indicating that these two families of repetitive sequences 
have contributed to chromosomal evolution in the pig lineage. Dif- 
ferent families of transposable elements seem to have been active in 
the cetartiodactyl ancestor. The cetartiodactyl EBRs are enriched for 
LINE1 elements and tRNA“"-derived SINEs. tRNA“'-derived 
SINEs, previously found over-represented in cetartiodactyl EBRs 
defined in the bovine genome”, originated in the common ancestor of 
cetartiodactyls*’. Our observation that these elements are also enriched 
in porcine EBRs strongly supports the hypothesis that active transpos- 
able elements promote lineage-specific genomic rearrangements. 

A stringent set of porcine to human one-to-one orthologues using 
the MetaCore database revealed that porcine EBRs and adjacent inter- 
vals are enriched for genes involved in sensory perception of taste 
(P<8.9 X 10 °; FDR <0.05), indicating that taste phenotypes may 
have been affected by events associated with genomic rearrangements. 
Pigs have a limited ability to taste NaCl’*. SCNN1B, a gene encoding a 
sodium channel involved in the perception of salty tastes, is located in 
a porcine-specific EBR. Another gene, ITPR3, encoding a receptor for 
inositol triphosphate and a calcium channel involved in the percep- 
tion of umami and sweet tastes, has been affected by the insertion of 
several porcine-specific SINE mobile elements into its 3’ untranslated 
region (3’ UTR), consistent with our observation of a higher density 
of transposable elements in EBRs. In addition to 8 bitter taste receptor 
genes annotated by Ensembl and which were used in the gene enrich- 
ment analysis, we identified 9 intact genes, to give a total number of 17 
TAS2R receptors in the pig (Supplementary Table 13). This compares 
to 18 intact bitter taste receptors in cattle, 19 in horse, 15 in dog and 25 
in humans”. Of the 14 bitter taste receptor genes that were mapped 
to a specific pig chromosome (SSC), 10 were found near 2 EBRs on 
SSC5 and SSC18 (Supplementary Tables 13 and 15). We also found 
that at least four taste receptors (TASIR2, TAS2R1, TAS2R40 and 
TAS2R39) have been under relaxed selection (Supplementary Infor- 
mation). Pigs are not sensitive to bitter tastes and tolerate higher 
concentrations of bitter compounds than humans*”*. Thus, pigs 
can eat food that is unpalatable to humans. A review of the porcine 
taste transduction network (Supplementary Fig. 17) revealed addi- 
tional genes affected by rearrangements that affect ‘apical and taste 
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receptor cell’ processes. Together with the observed over-representa- 
tion of genes related to ‘adrenergic receptor activity’ and ‘angiotensin 
and other binding’ categories in the pig EBRs (Supplementary 
Fig. 18), our data indicate that chromosomal rearrangements signifi- 
cantly contributed to adaptation in the suid lineage. 


Population divergence and domestication 

Divergence between Asian and European wild boar 

We investigated the evolution within Sus scrofa in Eurasia by sequen- 
cing ten individual unrelated wild boars from different geographical 
areas. In total, 17,210,760 single nucleotide polymorphisms (SNPs) 
were identified among these ten wild boars. The number of SNPs 
segregating in the four Asian wild boars (11,472,192) was much higher 
than that observed in the six European wild boars (6,407,224) with only 
2,212,288 shared SNPs. This higher nucleotide diversity was visible in 
the distribution of heterozygous sites of the Asian compared to the 
European wild boar genomes (Fig. 2). Phylogenomic analyses of com- 
plete genome sequences from these wild boars and six domestic pigs 
revealed distinct Asian and European lineages (Supplementary Fig. 23) 
that split during the mid-Pleistocene 1.6-0.8 Myr ago (Calabrian stage, 
Frantz, L. A. F. et al., manuscript submitted). Colder climates during 
the Calabrian glacial intervals probably triggered isolation of popula- 
tions across Eurasia. Admixture analyses (Supplementary Infor- 
mation) within Eurasian Sus scrofa disclosed gene flow between the 
northern Chinese and European populations consistent with pig migra- 
tion across Eurasia, between Europe and northern China, throughout 
the Pleistocene. Our demographic analysis on the whole-genome 
sequences of European and Asian wild boars (Fig. 3) revealed an 
increase in the European population after pigs arrived from China. 
During the Last Glacial Maximum (LGM; ~20,000 years ago)”®, how- 
ever, Asian and European populations both suffered population bottle- 
necks. The drop in population size was more pronounced in Europe 
than Asia (Fig. 3), suggesting a greater impact of the LGM in northern 
European regions and probably resulting in the observed lower genetic 
diversity in modern European wild boar. 

The deep phylogenetic split between European and Asian wild 
boars is further supported by our observation of 1,272,737 fixed dif- 
ferences between the six European and four Asian wild boars, 1,706 of 
which are non-synonymous mutations in 1,191 different genes. Genes 
involved in sensory perception, immunity and host defence were 
among the most rapidly evolving genes (Supplementary Table 28), 
further strengthening the conclusions from our analysis of immunity- 
related pig proteins. This conclusion is further supported by our 
observation that these genes are also enriched in porcine segmental 
duplications (Supplementary Information). 
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Figure 2 | Distribution of heterozygosity for individual pig genomes. 
Shown is the distribution of the heterozygosity as the log,(SNPs) per 10k bin. 
a, Wild Sus scrofa: blue, south China; green, north China; orange, Italian; red, 
Dutch. b, Breeds: blue, Chinese breeds (Jiangquhai, Meishan, Xiang); red- 
yellow, European breeds (Hampshire, large white, landrace). Note that the 
Hampshire breed is a North American breed of European origin. 
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Figure 3 | Demographic history of wild boars. Demographic history was 
inferred using a hidden Markov model (HMM) approach as implemented in 
pairwise sequentially Markovian coalescence (PSMC)*. In the absence of 
known mutation rates for pig, we used the default mutation rate for human (1) 
of 2.5 X 10” *. For the generation time (g) we used an estimate of 5 years. The 
Last Glacial Maximum (LGM) is highlighted in grey. WB,), wild boar 
Netherlands; WB;,, wild boar Italy; WByq, wild boar north China; WBs.y, wild 
boar south China. 


To investigate further whether specific regions in the genome of 
European and Asian wild boar have been under positive selection, a 
selective sweep analysis was performed on the ten wild boar genome 
sequences using an approach similar to that recently described in the 
comparison of Neanderthal and Homo sapiens genomes”’. Regions in 
the genome under strong positive selection after the divergence of 
these two populations are expected to share fewer derived alleles. 
Using stringent criteria (Supplementary Information), we identified 
a total of 251 putative selective sweep regions, with an average size 
of 111,269 base pairs (bp), together comprising around 1% of the 
genome and harbouring 365 annotated protein-coding genes (Sup- 
plementary Table 26). Many of these regions (112) do not contain any 
currently annotated protein-coding exons. In contrast, the 10 putative 
selective sweep regions located between positions 39-43 Mb on SSC3 
together harbour 93 genes. This SSC3 region (Supplementary Fig. 25) 
is directly adjacent to the centromere and exhibits low recombination 
rates**, Low recombining regions have been shown to be more prone 
to selective sweeps and meiotic drive”””°. Although similar large puta- 
tive selective sweep regions close to the centromere were only 
observed on SSC6 between positions 56.2-57.5 Mb, on most chromo- 
somes selective sweep regions tended to cluster in the central part of 
chromosomes, thus exhibiting a clear correlation with regions of low 
recombination (Supplementary Fig. 27). As expected, regions with the 
highest nucleotide differentiation between European and Asian wild 
boars were observed in high recombination regions towards the end of 
the chromosomes on both metacentric and acrocentric chromosomes”. 

The putative selective sweep regions displayed significant over-rep- 
resentation of genes involved in RNA splicing and RNA processing, 
indicating possible changes in the regulation of genes at the level of 
RNA processing (Supplementary Table 27). Several of these genes 
(CELF1, CELF6, WDR83, RBM39, RBM6, HNRNPA1, HNRNPM) 
are involved in alternative splicing, and, small differences in expression 
might affect alternatively spliced transcripts of specific genes. Evolution 
of regulatory splicing factors such as heterogeneous ribonucleoprotein 
particle (hnRNP) proteins has been proposed as an evolutionary 
model for alternative splicing*', and genetic variation in such factors 
can affect alternative splicing and result in different phenotypes or 
disease**. Our observation that specific genes involved in splicing show 
accelerated evolution in the pig lineage (Fig. 1) supports this hypo- 
thesis. Of particular interest is the selective sweep region observed at 
position 26 Mb on SSC3 around the ERI2 gene (Fig. 4), which encodes 
ERI exoribonuclease family member 2. Different gene variants have 
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Figure 4 | Putative selective sweep region around the ERI2 gene on SSC3. 
The y axis shows the log-transformed value of the ratio for the observed/ 
expected derived allele frequency using a sliding window at a bin size of 
50,000 bp. The x axis shows the position on SSC3 in base pairs. 


been fixed in European and Asian wild boar coding for proteins that 
differ at two amino acid positions: Cys52Arg and His358Leu encoded 
by exons 3 and 9 of the ERI2 gene, respectively. The precise function of 
ERI2 is unknown but the ERI1 exoribonuclease family members have 
been shown to be involved in mRNA decay’ and in Caenorhabditis 
elegans ERI-1 has been shown to be involved in the degradation of 
microRNAs (miRNAs)**. 


Independent domestication and admixture events in domestic 
breeds 

A phylogenetic tree constructed using four European wild boar and 
domestic pigs and six East Asian wild boar and domestic pigs revealed 
a clear distinction between European and Asian breeds, thus substan- 
tiating the hypothesis that pigs were independently domesticated 
in western Eurasia and East Asia’. An admixture analysis revealed 
European influence in Asian breeds, and a ~35% Asian fraction in 
European breeds (Supplementary Table 24). These results are con- 
sistent with the known exchange of genetic material between 
European and Asian pig breeds**. We also observed that European 
breeds form a paraphyletic clade, which cannot be solely explained by 
varying degrees of Asian admixture (Supplementary Information). 
Within each continent, our analysis revealed different degrees of related- 
ness between breeds and their respective wild relatives (Supplementary 
Table 20). 

During domestication, pigs were often allowed to roam in a semi- 
managed state and recurrent admixture between wild and domesti- 
cated individuals was not uncommon, especially in Europe*’. Thus, the 
most likely explanation for the paraphyletic pattern seen in domestic 
individuals is a long history of genetic exchange between wild and 
domestic pigs. 


The pig as a biomedical model 

The pig is an important biomedical model and the ability to generate 
transgenics and knockouts in combination with somatic nuclear cloning 
procedures has resulted in a number of models for specific human 
diseases**. Naturally occurring mutations also offer opportunities to 
use pigs as biomedical models*”**. To explore the potential for natural 
models further, predicted porcine protein sequences were compared 
with their human orthologues. We observed 112 positions where the 
porcine protein has the same amino acid that is implicated in a human 
disease (Supplementary Table 29). Most of these changes in humans 
have been shown to increase risk in multifactorial traits such as obe- 
sity (ADRB3, SDC3) and diabetes (PPPIRA, SLC30A8, ZNF615) or 
shown to result in relatively mild phenotypes (for example, dyslexia: 
KIAA0319) or late-onset diseases such as Parkinson’s disease (LRRK2, 
SNCA) and Alzheimer’s disease (TUBD1, BLMH, CEP192, PLAU). 
These porcine variants are of interest, as they will allow detailed 
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characterization in an experimental model organism whose physi- 
ology is very similar to that of human. 

Among 32,548 non-synonymous mutations identified by sequen- 
cing 48 individual pigs, representing 8 different European and Asian 
breeds and wild boars*’, 6 protein variants implicated in human 
disease were identified (Supplementary Table 30). In addition, 
another 157 nonsense mutations in 142 genes were identified, 11 of 
which have also been implicated in human disease (Supplementary 
Table 31). Most of these 11 variants were only observed in a hetero- 
zygous state and those for which homozygous individuals were 
observed probably result in either a mild phenotype (ASS1, mild form 
of citrullinaemia in humans) or in phenotypes unlikely to affect the 
fitness of wild boars (RBBP8, pancreatic carcinomas). Our estimate 
for the average number of nonsense mutations per individual (~30) is 
smaller than that observed in humans* despite the observed threefold 
higher nucleotide diversity in pigs”. This is in agreement with the 
higher effective population size in the pig compared to that for the 
human population, which exhibited a strong bottleneck followed by 
an exponential increase in size during recent history*'. 

When considering pig-to-human xenotransplantation, porcine 
endogenous retroviruses (PERVs) pose a risk of zoonotic infection. 
The pig genome contains fewer endogenous retroviruses than 
many vertebrates, including humans and mice, and most PERVs were 
characterized as defective. However, the potential risk posed by re- 
activation of rare replication-competent PERVs and defective PERVs 
by recombination remains, as shown for murine ERVs (XMRV)”. 
Most PERVs consist of y and y-like groups (68%), with B-retroviral 
ERVs comprising the second most abundant group (Supplementary 
Fig. 15). Our phylogenetic study shows a particularly close relation- 
ship between the most intact yl group of PERVs (v1) and murine 
y-ERVs, suggesting a potential recent instance of murine-to-porcine 
transmission of yl ERVs (Supplementary Fig. 15). We identified 
20 almost intact PERV y1 loci (Supplementary Table 10), none of 
which contained a complete set of gag, pol or env open reading frames, 
indicating that these proviruses are not replicable. We also identified 
four B-retroviral PERVs, each containing defects, primarily in env. 
These were distantly related to intracisternal type A particle (IAP) 
proviruses of mice and the mouse mammary tumour virus (MMTV)- 
like (HML) proviruses of humans. None of the above loci was shared 
in more than 120 pigs tested, indicating considerable PERV poly- 
morphisms. 


Conclusion 


The draft pig genome sequence reported here has illuminated the 
evolution of Sus scrofa and confirmed its speciation in South East 
Asia and subsequent domestication at multiple regions across 
Eurasia. The high-quality annotated reference genome sequence has 
already proven to be a critical framework for comparing individual 
genomes*”*** and its value is further illustrated in associated papers 
published elsewhere (http://www.biomedcentral.com/series/swine). 
The genome sequence also provides a valuable resource enabling 
effective uses of pigs both in agricultural production and in biomedi- 
cal research. 


METHODS SUMMARY 


Assembly. We constructed a hybrid de novo assembly based primarily on 
sequences from BAC clones sequenced clone-by-clone and supplemented with 
Illumina whole-genome shotgun (WGS) reads. BAC clones were selected from 
the high-resolution physical (BAC contig) map® with CHORI-242 library clones 
prepared from DNA from a single Duroc sow (Duroc 2-14) chosen preferentially. 
The WGS sequence data were generated using DNA isolated from the same 
animal. The BAC-derived sequence data were assembled into sequence contigs 
using Phrap on a clone-by-clone basis and subsequently independently 
assembled WGS contigs (Supplementary Information) were used to extend 
BAC clone-derived sequence contigs and to close gaps between clone-derived 
contigs. Further details and other methods are described in Supplementary 
Information. 
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Pancreatic cancer genomes reveal 
aberrations in axon guidance pathway genes 


A list of authors and their affiliations appears at the end of the paper 


Pancreatic cancer is a highly lethal malignancy with few effective therapies. We performed exome sequencing and copy 
number analysis to define genomic aberrations in a prospectively accrued clinical cohort (n = 142) of early (stage I and II) 
sporadic pancreatic ductal adenocarcinoma. Detailed analysis of 99 informative tumours identified substantial 
heterogeneity with 2,016 non-silent mutations and 1,628 copy-number variations. We define 16 significantly mutated 
genes, reaffirming known mutations (KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARIDIA and SF3B1), and uncover 
novel mutated genes including additional genes involved in chromatin modification (EPC1 and ARID2), DNA damage 
repair (ATM) and other mechanisms (ZIM2, MAP2K4, NALCN, SLC16A4 and MAGEA6). Integrative analysis with in vitro 
functional data and animal models provided supportive evidence for potential roles for these genetic aberrations in 
carcinogenesis. Pathway-based analysis of recurrently mutated genes recapitulated clustering in core signalling 
pathways in pancreatic ductal adenocarcinoma, and identified new mutated genes in each pathway. We also 
identified frequent and diverse somatic aberrations in genes described traditionally as embryonic regulators of axon 
guidance, particularly SLIT/ROBO signalling, which was also evident in murine Sleeping Beauty transposon-mediated 
somatic mutagenesis models of pancreatic cancer, providing further supportive evidence for the potential involvement 


of axon guidance genes in pancreatic carcinogenesis. 


Pancreatic cancer is the fourth leading cause of cancer death, with an 
overall 5-year survival rate of <5%, statistics that have not changed in 
almost 50 years’. Advances in neoadjuvant and adjuvant chemothera- 
peutic regimens have resulted in some improvement in outcome, but 
pancreatectomy remains the single most effective treatment modality 
for pancreatic cancer, and offers the only potential for cure. Only 20% of 
patients present with localized, non-metastatic disease which is suitable 
for resection’. Those who undergo resection and receive adjuvant 
therapy have a median survival of 12-22 months and a 5-year survival 
of 20-25%’. Existing systemic therapies are only modestly effective and 
the median survival for patients with metastatic disease remains 
6 months. Genomic characterization of pancreatic ductal adenocarci- 
noma (PDAC), which accounts for over 90% of pancreatic cancer, has 
so far focused on targeted polymerase chain reaction (PCR)-based 
exome sequencing of primary and metastatic lesions propagated as 
xenografts or cell lines*. A deeper understanding of the underlying 
molecular pathophysiology of the clinical disease is needed to advance 
the development of effective therapeutic and early detection strategies. 


Clinical cohort 


A cohort of 142 consecutive patients with primary operable, untreated 
PDAC who underwent pancreatectomy with curative intent (pre- 
operative clinical stages I and II) were recruited, and consent was 
obtained for genomic sequencing through the Australian Pancreatic 
Cancer Genome Initiative (APGI), the Baylor College of Medicine 
Pancreatic Cancer Genome Project and the Ontario Institute for 
Cancer Research Pancreatic Cancer Genome Study (ABO collabora- 
tion) between June 2005 and June 2011 as part of the International 
Cancer Genome Consortium (ICGC)°. Detailed clinico-pathological 
characteristics of the cohort demonstrated features typical of resected 
PDAC with regard to tumour size, grade, lymph node metastasis 
and survival when compared to multiple retrospectively acquired 
cohorts**, defining the accrued population as representative of 
the clinical disease in the community (Supplementary Table 1 and 
Supplementary Fig. 1). 


Cellularity and mutation detection 

A major challenge in genomic sequencing is the low malignant epithe- 
lial cell content of many cancers, which can adversely impact on the 
sensitivity of mutation detection. Most sequencing studies so far have 
used samples with >70% tumour cellularity, or cell lines/xenografts*”. 
To implement genomic sequencing approaches in clinical practice, it is 
imperative to efficiently and accurately detect actionable mutations in 
diagnostic clinical samples. We devised methodologies to overcome 
the challenges associated with extensive desmoplastic stroma that is 
characteristic of the majority of PDAC, and these strategies facilitated 
the discovery of novel molecular mechanisms in the pathophysiology 
of this disease. The cellularity of each primary sample was estimated 
through pathological review, deep amplicon-based sequencing of 
exons 2 and 3 of KRAS (average depth of 1,000), and single nucleo- 
tide polymorphism (SNP) array-based cellularity estimates using a 
novel algorithm (qpure)’®. KRAS mutations were identified in 93% 
of 142 cases and tumour cellularity ranged from 5% to 85% with a 
mean of 38% (Supplementary Table 2, Supplementary Figs 2 and 3, and 
Supplementary Methods). 

To inform cellularity thresholds for subsequent analyses, we defined 
the impact of stromal DNA content on mutation detection by exome 
capturing and sequencing different mixtures of cancer cell line and 
matched germline DNA (100%, 80%, 60%, 40%, 20% and 10% cell line 
DNA) when sequenced to a depth of 70X coverage. Using these data as 
a standard, the median sensitivity to detect true positives across all 
samples in the cohort with greater than 20% epithelial cellularity was 
estimated at 45% (Supplementary Table 3). An informative cohort of 
99 patients who had greater than 20% cellularity and/or = 10 validated 
somatic mutations was taken forward for further analysis. 


Mutation detection and CNV analysis 

We performed hybrid-selection-based capture and sequencing of the 
entire exomes of tumour and matched normal DNA derived from all 
142 patients using a combination of capture systems and next- 
generation sequencing platforms (see Supplementary Methods). The 
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Table 1 | Mutations in pancreatic ductal adenocarcinoma (n = 99) 


Mutation class Total 
Missense 1,684 
Nonsense 99 
Splice site 89 
Insertion/deletion 144 
Non-silent 2,016 
Silent 611 


sequence depths at each site (APGI 65, BCM 104X and OICR 205 xX) 
were adopted to ensure suitable sensitivity across their respective 
cohorts (Supplementary Table 3). In the informative 99 samples, we 
detected 2,627 high-confidence mutations, 2,016 of which were non- 
silent (Table 1). A total of 1,502 of these events (1,350 non-silent) were 
independently validated via an orthogonal sequencing method (see 
Supplementary Methods). The average number of mutations detected 
per patient was 26 (range 1-116), consistent with the expected sen- 
sitivity based on cellularity estimates and previous studies*" (Sup- 
plementary Table 2). We confirmed the high prevalence of genetic 
aberrations known to be important in PDAC and observed mutations 
in 38 of the 79 genes (48% overlap) that occurred more than once 
previously reported by ref. 4, and 186 of all 998 mutated genes (19% 
overlap) in that study. We also defined a large number of novel muta- 
tions (1,456 genes), most of which occurred at low frequency (see 
Supplementary Tables 4— 6 and Supplementary Fig. 4 for detailed 
comparisons). The observed transversion/transition rates in the cohort 
correlated closely with those previously reported in PDAC cell lines 
and xenografts (Supplementary Table 7). 

Significant mutated gene analysis’* of genes with non-silent muta- 
tions that occurred in 2 or more individual cancers identified 16 genes 
in the top 20 mutated genes in 2 of 3 stringent analytical approaches 
(Table 2, Supplementary Table 8 and Supplementary Methods) and 
reaffirmed the importance of mutations known to occur in PDAC: 
KRAS, TP53, CDKN2A, SMAD4, MLL3, TGFBR2, ARIDIA and 
SF3B1. Novel significantly mutated genes included additional genes 
involved in chromatin modification (EPC1 and ARID2) and ATM, 
recently implicated as a PDAC susceptibility gene through bi-allelic 
inactivation in a case of familial PDAC (germline mutation and loss 
of heterozygosity (LOH) in the tumour)’. Aberrations of ATM 
occurred in 8% of our cohort (mutated in 5%, LOH or loss in 5%, 
with two patients exhibiting both mutation and LOH or loss) and 
mutations detected in other genes not previously reported: ZIM2, 
MAP2K4, NALCN, SLC16A4 and MAGEA6 (Table 2). GISTIC2.0% 
identified 30 genes affected by copy-number alterations (Q value 
<0.0001) and included losses of CDKN2A and SMAD4 (Supplemen- 
tary Table 4). 


Pathways in pancreatic cancer 

To better understand potential underlying mechanisms of import- 
ance in PDAC, we performed a series of pathway analyses using genes 
that were recurrently mutated in two or more individuals using 
GeneGO", and identified mechanisms known to be important in cancer: 
GI1/S checkpoint machinery (P = 1.49 X 10°), apoptosis (P = 1.32 X 
10 *), regulation of angiogenesis (P = 7.72 X 10 *) and TGF-B sig- 
nalling (P= 9.50 X 10 *). Interestingly, novel gene signatures were 
enriched in our cohort, including axon guidance (P = 5.30 X 10°) 
(Supplementary Table 9). The inclusion of mutation data for 24 cases 
from ref. 4 strengthened the association of axon guidance (P= 
3.3 X 10 ”), and was more evident still when all mutated genes in our 
data set were used as input (P = 4.67 X 10 °). 


Functional relevance of genomic events 

Differentiating somatic driving events of carcinogenesis from passenger 
mutations is a major challenge in cancer genomics"®. Despite significant 
advances in computational algorithms, experimental evidence of func- 
tional relevance is paramount. We used data from three published 
experimental biological screens to infer functional consequences for 
the individual genomic events and the pathways we identified. These 
included data from two independent Sleeping Beauty transposon (SB) 
mutagenesis screens in Kras transgenic mouse models of PDAC’”"* 
and an in vitro short hairpin RNA (shRNA) screen which examined 
the consequences of downregulating 11,194 putative cancer genes on 
survival in a panel of 102 cell lines (13 pancreatic)'” (Supplementary 
Methods and Supplementary Figs 5 and 6). Data from these screens 
confirmed the functional importance of KRAS, TP53, CDKN2A and 
SMAD4 mutations and attributed potential functional relevance 
to most significantly mutated genes—MLL3, TGFBR2, SF3B1, EPC1, 
ARIDIA, ARID2, MAP2K4, ATM, NALCN, ZIM2, SLC16A4 
(Table 2)—and many genes mutated at low frequency (Supplemen- 
tary Table 4). 

Pathway analysis of high confidence insertions in SB transposon 
mutagenesis screens demonstrated enrichment for axon guidance 
genes (P = 1.6 X 10~*), providing independent supportive evidence 
for a potential role in the pathogenesis of PDAC-. In these screens, 14 
genes involved in axon guidance pathways were detected (5 genes 
common to both). In addition, a further 32 genes were mutated in 
at least one SB pancreatic tumour (out of 21) but did not meet the 
significance threshold with the stringent analyses that were applied” 
(Supplementary Tables 10 and 11). 


Axon guidance pathway genes 


The class of genes traditionally described for their roles in axon guidance 
(semaphorins, slits, netrins and ephrins) are important regulators of 


Table 2 | Significantly mutated genes in pancreatic ductal adenocarcinoma 


Gene symbol Gene name and protein function SB mutagenesis* shRNAt 
KRAS Oncogene; GTPase; activation of MAPK activity Yes Yes 
TP53 Tumour suppressor p53; DNA damage response - Yes 
CDKN2A Cyclin-dependent kinase inhibitor 2A; G1/S transition of mitotic cell cycle; tumour suppressor Yes - 
SMAD4 Mothers against decapentaplegic homologue 4; BMP signalling pathway Yes Yes 
MLL3 Myeloid/lymphoid or mixed-lineage leukaemia protein 3; DNA binding; regulation of transcription Yes Yes 
TGFBR2 Transforming growth factor-B receptor type Il; regulation of growth Yes - 
ARID1A AT-rich interactive domain-containing protein 1A; SWI/SNF complex; chromatin modification Yes Yes 
ARID2 AT-rich interactive domain-containing protein 2; chromatin modification Yes - 
EPCI Enhancer of polycomb homologue 1; histone acetylation Yes - 
ATM Ataxia telangiectasia mutated; DNA damage response - Yes 
SF3B1 Splicing factor 3B subunit 1; nuclear MRNA splicing - Yes 
ZIM2 Zinc finger imprinted 2; regulation of transcription - Yes 
MAP2k4 Dual specificity mitogen-activated protein kinase kinase 4; Toll-like receptor signalling pathway Yes Yes 
NALCN Sodium leak channel non-selective protein; sodium channel activity - Yes 
SLC16A4 Solute carrier family 16 member 4; monocarboxylate transporter - Yes 
MAGEA6 Melanoma-associated antigen 6; protein binding - ND 


ND, not determined. 
* Significant insertion sites in two independent Sleeping Beauty mutagenesis screens?”8. 
+ In vitro shRNA screens in 102 cancer cell lines with effect on cell survival?®. 
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normal neuronal migration and positioning during embryonic 
development. More recently, they have been implicated in cancer cell 
growth, survival, invasion and angiogenesis”; however, the incidence 
of aberrations in these genes in cancer is largely unknown. We iden- 
tified recurrent mutations and copy-number variations (CNVs) of 
axon guidance pathway genes in this cohort (Fig. 1 and Supplemen- 
tary Table 4): SLIT2 and ROBO2 mutations were present in 5% of 
patients, with focal copy-number losses of ROBO1, and SLIT2 detected 
by GISTIC2.0 analysis and confirmed by manual review, potentially 
having an impact on a further 15% of the cohort, suggesting that 
aberrant SLIT/ROBO signalling is potentially a common feature of 
PDAC (Figs 1 and 2). In addition, we used targeted PCR-based se- 
quencing of an additional 30 cases of PDAC for axon guidance genes 
and identified mutations in ROBO1 in two patients and additional 
mutations in SLIT2 and ROBO2 (one patient each). Low mRNA 
expression of the ROBO2 receptor was associated with poor patient 
survival (P = 0.04). Furthermore, high mRNA expression of ROBO3, a 
known inhibitor of ROBO2 signalling’, demonstrated an appropriate 
reciprocal inverse association with poor survival (P < 0.006) (Fig. 2). 

Class 3 semaphorins (SEMA3A and SEMA3E) exhibited significant 
amplification in 18% of patients and an additional 3% harboured 
mutations (Fig. 1). Semaphorins signal through neuropilin and plexin 
receptors to elicit their effects’. SEMA3A amplification correlated 
with high mRNA expression on microarray (P = 0.03), and high 
mRNA expression of SEMA3A and PLXNA1, another molecule cen- 
tral to semaphorin signalling, were both associated with poor patient 
survival on univariate analysis (Fig. 3a), and were independently pro- 
gnostic on multivariate analyses with clinico-pathological variables 
(Supplementary Table 12). 

To elucidate further the significance of the observed CNV events, we 
reviewed copy number, CNV segment size and changes in heterozyg- 
osity of axon guidance genes in a recent independent CNV analysis of 
39 fine-needle aspiration biopsies” and the 16 PDAC cell lines in the 
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CONAN database (http://www.sanger.ac.uk/cosmic)™*. Overall, the 
predominant changes recapitulated our studies, showing frequent 
focal losses within genes involved in SLIT/ROBO signalling, and gains 
in genes involved in canonical semaphorin signalling (Supplementary 
Tables 4, 13 and 14). 

To assess whether dysregulation of axon guidance genes is associated 
with early neoplastic transformation, as are many developmental sig- 
nalling pathways, we examined mRNA expression in murine models of 
early pancreatic carcinogenesis (in vitro acinar-to-ductal metaplasia 
and in vivo pancreatic injury). Expression levels of components of 
SLIT/ROBO and semaphorin signalling changed progressively from 
normal pancreas, through acinar-to-ductal metaplasia and pancreatic 
injury to genetically engineered murine PDAC, indicating a role for the 
dysregulation of these axon guidance genes in tumour initiation and 
progression (Fig. 3b and Supplementary Table 15). 


Discussion 

We devised methodologies to optimize mutation detection for 
clinical samples in a large cohort of patients and reaffirm known 
mutations in PDAC, better define their prevalence in a large cohort 
of early PDAC, and identify potential novel drivers in this disease. 
Somatic mutations in ATM were identified in a significant proportion 
of patients (8%), highlighting the importance of BRCA-mediated 
DNA damage repair mechanisms in sporadic PDAC as well as familial 
disease'*. Previously, mutations in individual genes involved in chro- 
matin remodelling such as ARID1A* have been described and addi- 
tional genes identified here (EPC1 and ARID2) infer that chromatin 
remodelling may have an important role in PDAC, along with other 
cancer types”°. 

Novel mutations in genes traditionally described for their roles in 
axon guidance were also observed by a combination of genomic data 
and supportive experimental evidence from independent murine SB 
mutagenesis screens. Axon guidance is integral to organogenesis, 
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Figure 1 | Mutations and copy number variation in axon guidance genes. 
Axon guidance pathway genes with recurrent mutations and/or copy-number 
changes defined by GISTIC2.0 analysis (Q < 0.2), and manually reviewed for 
focal alterations. a, SNV and CNV frequency per patient with gene-centric 
summary (left) and patient-centric summary (top); numbers of patients with 
mutations and proportion of each event are presented. Please see 
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Supplementary Table 4 for further details. b, Clinico-pathological variables for 
individual patients. APGI, Australian Pancreatic Cancer Genome Initiative; 
BCM, Baylor College of Medicine; IPMN, intraductal papillary mucinous 
neoplasm; Mod, moderately differentiated; OICR, Ontario Institute for Cancer 
Research; PDAC, pancreatic ductal adenocarcinoma; Undiff, undifferentiated. 
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Figure 2 | SLIT/ROBO signalling in pancreatic ductal adenocarcinoma. 

a, SLIT/ROBO signalling normally enhances -catenin complex formation 
with E-cadherin and suppresses WNT signalling activity. Loss of ROBO1/2 
signalling promotes stabilization of b-catenin, which decreases E-cadherin 
complex formation and cell adhesion and augments WNT signalling activity 
through increased nuclear translocation of b-catenin. In addition, SLIT/ROBO 
signalling can downregulate MET signalling activity; loss of ROBO signalling 
activity promotes MET signalling downstream and may have an impact on 
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Figure 3 | Axon guidance genes in human and murine pancreatic ductal 


adenocarcinoma. a, Kaplan-Meier survival curves showing co-segregation of 


aberrant expression of components of semaphorin signalling with outcome. 

Amplification at SEMA3A and PLXNA1 loci was associated with high mRNA 
expression and both are independent poor prognostic factors. b, Quantitative 
RT-PCR for components of semaphorin and SLIT/ROBO signalling in murine 
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therapeutic strategies aimed at inhibiting MET activity at the receptor level. 
(Adapted from ref. 20.) Aberrations in SLIT2 and/or ROBO1/2 affected 23% of 
patients (6% mutated with 1 patient showing mutations in both SLIT2 and 
ROBO2), with 18% demonstrating CNV corresponding to loss of the gene. 

b, c, High expression of SLIT receptor ROBO2 was associated with a better 
prognosis (b), and high expression of ROBO3, an inhibitor of ROBO2, showed 
an inverse relationship, with high levels associated with poor survival (c). HR, 
hazard ratio. 
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models of early (acinar-to-ductal metaplasia (ADM) and pancreatic injury) and 
established PDAC in genetically engineered mice with a Pdx1-promoter-driven 
activating mutation of Kras and mutant Tp53 allele (Pdx1-Cre; LSL-Kras¢”; 
LSL-Trp53™! 7H) Error bars represent standard error of the mean (see 
Supplementary Table 15 for details). 
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regeneration, wound healing and other basic cellular processes”””’. 


The widespread genomic aberrations observed here in axon guidance 
genes suggests that they may have a role in PDAC, joining mounting 
evidence in other cancers”®”*, including a recent report demonstrating 
ROBO2 mutations in liver-fluke-associated cholangiocarcinoma”. In 
addition, evidence from cancers of the lung, breast, kidney and cervix 
implicate aberrant SLIT/ROBO signalling in carcinogenesis”; Robo! 
knockout mice develop bronchial hyperplasia and focal dysplasia, and 
inactivation of Slit2 and Slit3 leads to the development of hyperplastic 
disorganized lesions in the breast*®. Upregulation of MET and WNT 
signalling have important roles in PDAC, and recent data indicate 
that SLIT/ROBO signalling modulates MET and WNT signalling 
activity through CDC42 and f-catenin, respectively”. Loss of SLIT/ 
ROBO signalling can potentially be an alternative mechanism for 
deregulating these pathways downstream of their receptors, and in 
addition could influence the activity of inhibitors that target these 
upstream components, for example, MET inhibitors (Fig. 2). 

Class 3 semaphorins are the only secreted semaphorins in ver- 
tebrates. They regulate cell growth, invasiveness and angiogenesis, 
and are highly expressed in metastatic cells in many cancer types*?*". 
Although aberrant semaphorin signalling in cancer seems to be organ 
specific’, our finding that high expression of SEMA3A and its receptor 
PLXNAI co-segregates with poor patient survival is supported by a 
previous study that reported this association and also demonstrated 
promotion of invasiveness of PDAC cell lines by SEMA3A”’. Thera- 
peutics targeting molecules involved in axon guidance have been 
developed as potential strategies to facilitate neuronal regeneration 
after injury”’, but are yet to be assessed for their role in cancer treatment. 

As illustrated here, global genomic analysis of large, well-annotated 
and clinically homogeneous cohorts of patients can identify mecha- 
nisms that are common among genomically diverse cancers, and will 
be pivotal in the development of novel therapeutic strategies that are 
guided by the determination of the molecular phenotype of individual 
patients**. Future work will be required to determine which key com- 
ponents, when damaged, drive the disease, and these mechanisms 
will need to be assessed in molecularly well-characterized preclinical 
models**. The potential therapeutic strategies identified will then 
require testing in appropriate clinical trials that are specifically designed 
to target subsets of patients stratified according to well-defined mole- 
cular markers**””. 


METHODS SUMMARY 

Sample acquisition and processing. Samples used were prospectively acquired 
and restricted to primary operable, non-pretreated pancreatic ductal adenocarci- 
noma. Representative sections were reviewed independently by at least one addi- 
tional pathologist with specific expertise in pancreatic diseases. Samples either 
had full face frozen sectioning performed in optimal cutting temperature (OCT) 
medium, or the ends excised and processed in formalin to verify the presence of 
carcinoma in the sample to be sequenced and to estimate the percentage of malig- 
nant epithelial nuclei in the sample relative to stromal nuclei. Macrodissection was 
performed if required to excise areas that did not contain malignant epithelium. 

Sequencing. Cellularity of each tumour sample was estimated with pathology 
review, deep sequencing of KRAS and a method developed using genome-wide 
SNP array data (qpure’’). Exon capture was performed using the SureSelect II or 
Nimblegen capture methods and paired-end sequenced on the SOLiD (v4) or 
GAII/HiSeq platforms. Somatic mutations were called and then verified on the 
Ion Torrent Personal Genome Machine (Life Technologies Corporation) and 454 
(Hoffman-La Roche Limited). 

Analysis. Significantly mutated genes were identified using the Genome MuSiC 
package’’. DNA copy number analyses were performed using the Illumina 
HumanOmnil Quad genotyping arrays and GenoCN software. Recurrent and sig- 
nificant copy number changes were identified using GISTIC2.0"*. Functional enrich- 
ment of gene categories was assessed using the Metacore package (Thomson-Reuters 
Corporation) and the MSigDB v3.0 database**. All sample information and data for 
mutation, copy number and expression analyses were submitted to the ICGC DCC 
at http://dcc.icgc.org/. A complete description of the materials and methods in- 
cluding approvals for human research and animal experimentation is provided in 
Supplementary Information. 
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Broad and potent neutralization of HIV-1 
by a gp41-specific human antibody 


Jinghe Huang'*, Gilad Ofek?*, Leo Laub', Mark K. Louder’, Nicole A. Doria-Rose’, Nancy S. Longo’, Hiromi Imamichi', 
Robert T. Bailer’, Bimal Chakrabarti*, Shailendra K. Sharma’, S. Munir Alam*, Tao Wang’, Yongping Yang”, Baoshan Zhang”, 
Stephen A. Migueles', Richard Wyatt*, Barton F. Haynes‘, Peter D. Kwong’, John R. Mascola? & Mark Connors! 


Characterization of human monoclonal antibodies is providing considerable insight into mechanisms of broad HIV-1 
neutralization. Here we report an HIV-1 gp41 membrane-proximal external region (MPER)-specific antibody, named 
10E8, which neutralizes ~98 % of tested viruses. An analysis of sera from 78 healthy HIV-1-infected donors demonstrated 
that 27°% contained MPER-specific antibodies and 8° contained 10E8-like specificities. In contrast to other neutralizing 
MPER antibodies, 10E8 did not bind phospholipids, was not autoreactive, and bound cell-surface envelope. The 
structure of 10E8 in complex with the complete MPER revealed a site of vulnerability comprising a narrow stretch of 
highly conserved gp41-hydrophobic residues and a critical arginine or lysine just before the transmembrane region. 
Analysis of resistant HIV-1 variants confirmed the importance of these residues for neutralization. The highly conserved 
MPER is a target of potent, non-self-reactive neutralizing antibodies, suggesting that HIV-1 vaccines should aim to 
induce antibodies to this region of HIV-1 envelope glycoprotein. 


Induction of an antibody response capable of neutralizing diverse 
HIV-1 isolates is a critical goal for vaccines that protect against HIV- 
1 infection. Potentially the greatest obstacle to achieving this goal is the 
extraordinary diversity that develops in the target of neutralizing anti- 
bodies, the envelope glycoprotein (Env). Although vaccines have thus 
far failed to induce broadly neutralizing antibody responses, there are 
examples of chronically infected patients with sera that neutralize 
highly diverse HIV-1 isolates’ *. These individuals provide evidence 
that it is possible for the human antibody response to neutralize highly 
diverse strains of HIV-1, although the mechanisms by which such 
responses are induced or mediated remain incompletely understood”””. 

Recently, isolation and characterization of human monoclonal 
antibodies from cells of chronically infected patients have provided 
considerable advances in understanding the specificities and mechan- 
isms underlying broadly neutralizing antibody responses to HIV-1. 
Env exists on the virion and infected-cell surface as a trimer of hetero- 
dimers made up of gp120 and gp41 subunits. For some time, only a 
small number of broadly neutralizing monoclonal antibodies had 
been isolated, consisting of one antibody that binds the CD4-binding 
site on gp120 (b12), one that binds a glycan configuration on the outer 
domain of gp120 (2G12), and three that bind the membrane-proximal 
external region (MPER) on gp41 (2F5, Z13el and 4E10)'*"*. More 
recently, considerably more broad and potent antibodies have been 
discovered that target the CD4-binding site of the envelope protein'*"” 
(for which VRCO1 is a prototype) and glycan-containing regions of the 
variable 1 (V1)/V2 and V3 regions of gp120*'*° (for which PG9 and 
PGT128 are prototypes). The specificities of these new antibodies are 
providing important information regarding antigen targets on Env to 
which the humoral immune response might be directed to mediate 
broad and potent neutralization. However, evidence for these specifi- 
cities in many chronically infected patients within our HIV-1-infected 
cohort’ is lacking, suggesting that broad and potent neutralization may 
be mediated by other specificities. 


Here we report isolation of a broad and potent gp41 MPER-specific 
human monoclonal antibody, 10E8, from an HIV- 1-infected individual 
with high neutralization titres. 10E8 is among the most broad and potent 
antibodies thus far described, and lacks many of the characteristics 
previously thought to limit the usefulness of MPER-specific antibodies 
in vaccines or passive therapies, including lipid binding and auto- 
reactivity. In addition, the crystal structure of 10E8, along with bio- 
chemical binding studies, demonstrate that the breadth of 10E8 is 
mediated by its unique mode of recognition of a structurally conserved 
site of vulnerability within the gp41 MPER. 


10E8 isolation and neutralizing properties 

To understand the specificities and binding characteristics that 
underlie a broadly neutralizing antibody response we developed tech- 
niques that permitted isolation of human monoclonal antibodies 
without previous knowledge of specificity*°. Serum from one donor, 
N152, exhibited neutralizing breadth and potency in the top 1% of our 
cohort against a 20 cross-clade pseudovirus panel (Supplementary 
Table 1)'. Peripheral blood CD19*IgM IgD IgA” memory B cells 
from this patient were sorted and expanded for 13 days with inter- 
leukin (IL)-2, IL-21 and CD40-ligand expressing feeder cells. The 
supernatants of ~ 16,500 B-cell cultures were screened and IgG genes 
from wells with neutralization activity were cloned and re-expressed”! 
and two novel antibodies (10E8 and 7H6) were isolated. 

Nucleotide sequence analysis of DNA encoding 10E8 and 7H6 
revealed that both were IgG3 antibodies and were somatic variants 
of the same IgG clone. These antibodies were derived from IGHV3- 
15*05 and IGLV3-19*01 germline genes, and were highly somatically 
mutated in variable genes of both heavy chain (21%) and A light chain 
(14%) compared to germ line. These antibodies also possessed a long 
heavy-chain complementarity-determining 3 region (CDR H3) loop 
composed of 22 amino acids (Fig. 1a). The heavy chains of 10E8 and 
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7H6 3-15*05 = 3-3*01 1*01 22 63/294 (21%) 
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Figure 1 | Analyses of 10E8 sequence and neutralization. a, Inferred 
germline genes encoding the variable regions of 10E8 and 7H6. Dagger footnote 
symbol indicates that germline alleles were determined using the IMGT 
database (http://imgt.org). b, Neutralizing activity of antibodies against a 181- 
isolate Env-pseudovirus panel. Dendrograms indicate the gp160 protein 
distance of HIV-1 primary isolate Env glycoproteins. Letters indicate clade. 
Data below the dendrogram show the number of tested viruses, the percentage 
of viruses neutralized and the geometric mean ICs9 for viruses neutralized with 
an ICs9 <50 ug ml‘. Median titres are based on all tested viruses, including 
those with ICs) > 50 ug ml” ', which were assigned a value of 100. 


7H6 were identical and there were only two residue differences in the 
light chain (Supplementary Fig. 1)”. 

To assess neutralization activity of the clonal variants, they were 
initially tested against 5 Env pseudoviruses (Supplementary Table 1a), 
and monoclonal antibody 10E8 was selected for further study. To 
determine whether the neutralization activity of 10E8 was representa- 
tive of the overall neutralization specificity present in patient N152 
donor serum, the neutralization panel was expanded to 20 Env pseu- 
doviruses, and 10E8 was tested in parallel with N152 donor serum. 
Although there were some similarities in the pattern of neutralization 
of highly resistant variants, a correlation of the neutralization half- 
maximum inhibitory concentration (ICs59) of monoclonal antibody 
10E8 and inhibitory dilution (IDs9) of N152 serum did not achieve 
statistical significance (P= 0.11; Supplementary Fig. 2 and Sup- 
plementary Table 1b). This finding indicates that although 10E8 
may have a major role, the full breadth of neutralization of HIV-1 
by N152 serum is probably mediated by an amalgam of 10E8-like or 
other antibodies. 

To compare the neutralization potency and breadth of 10E8 with 
other broadly neutralizing anti-HIV-1 antibodies, 10E8 was then 
tested in a 181-isolate Env-pseudovirus panel in parallel with 4E10, 
2F5, VRCO1, NIH45-46, 3BNC117, PG9 and PG16 (Fig. 1b and 
Supplementary Table 1c). At an ICs9 below 50 pg ml ', 10E8 neutra- 
lized 98% of the tested pseudoviruses compared to 98% for 4E10 and 
89% for VRCO1. However, at an ICs below 1 pg ml’, 10E8 neutra- 
lized 72% of the tested viruses compared to 37% for 4E10. The median 
and geometric mean ICs9 values for 10E8 were below 1 pg ml. Thus, 
10E8 mediates broad and potent neutralization against a large range 
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of viruses and the potency is comparable to some of the best available 
monoclonal antibodies. 


10E8 epitope specificity and binding 

To map the epitope of the 10E8 antibody, we tested binding to different 
subregions of Env by enzyme-linked immunosorbent assay (ELISA). 
10E8 bound strongly to gp140, gp41 and the 4E10-specific MPER 
peptide, but not to gp120 (Fig. 2a). To map further the 10E8 epitope 
within the MPER, we examined binding of 10E8 to overlapping pep- 
tides corresponding to the 2F5 (656-671), Z13e1 (666-677) and 4E10 
(671-683) specificities. 1OE8 bound to the full MPER and the 4E10- 
specific peptides, but not 2F5- or Z13e1-specific peptides. Within the 
4E10 epitope, when a peptide with a truncated carboxy terminus was 
tested, 4E10.19 (671-680), 10E8 binding was weakened considerably, 
indicating that the three terminal amino acids of the MPER—Tyr681, 
Tle682 and Arg683—were crucial for 10E8 binding (Supplementary 
Fig. 3a). Consistent with these results, only the full MPER and 4E10- 
specific peptides blocked 10E8-mediated neutralization of the chimaeric 
Cl virus, which contains the HIV-2 Env with the HIV-1 MPER (Sup- 
plementary Fig. 3b). Taken together, these data indicate that the minimal 
10E8 epitope is located within residues 671-683 of the MPER, although 
additional contacts towards the amino terminus of the MPER could not 
be excluded. 

To map more precisely the epitope of 10E8, a panel of alanine 
mutant peptides scanning MPER residues 671-683 was used to block 
10E8 neutralization of the chimaeric Cl virus (Fig. 2b)”. MPER pep- 
tides with alanine substitutions at Trp 672, Phe 673 or Thr 676 failed 
to block 4E10 or 10E8 neutralization, indicating that these residues 
are critical for both 4E10 and 10E8 binding. Residues Asn 671 and 
Arg 683, both of which are not required for 4E10 binding, were found 
to be critical for 10E8 binding and neutralization (Supplementary 
Table 2 and Fig. 2b). We also tested the ability of 10E8 to neutralize 
HIV-1 JR2 pseudoviruses with alanine substitutions in MPER residues 
660-683 (Supplementary Table 3). Consistent with the effects ofalanine 
substitutions on peptide binding, residues Asn 671 and Arg 683 were 
critical for 10E8, but not 4E10, neutralization. Individual alanine sub- 
stitutions at residues 671-673, 680 and 683 resulted in reduced neu- 
tralization sensitivity to 10E8 most apparent at the ICg9 level rather 
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Figure 2 | Binding specificity of 1OE8. a, ELISA binding of monoclonal 


antibody (mAb) 10E8 or 4E10 to gp140, gp120, gp41 or 4E10 peptide. Error 
bars denote one standard error of the mean (s.e.m.). b, Inhibition of 
monoclonal antibody 10E8 or 4E10 neutralization of C1 HIV-2/HIV-1 MPER 
virus by 4E10 alanine scanning peptides. Residues shown in red indicate 
positions for which the alanine mutant peptide did not block neutralization. 
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than the ICso level. Although the mechanism for this phenomenon is 
unclear, a similar effect has been observed previously when MPER 
mutations cause partial resistance to 4E10**. Taken together, these 
results indicated that 10E8 recognized a novel epitope that overlaps 
the known 4E10 and Z13e1 epitopes but differs in a critical dependence 
on binding to Asn 671 and Arg 683, the last residue of the MPER. 
We next investigated whether the greater neutralization potency of 
10E8 compared to other MPER antibodies was a result of higher 
binding affinity to the MPER. Capture of a biotinylated peptide 
encompassing the full MPER (656-683) to a surface-plasmon res- 
onance (SPR) chip allowed the binding kinetics of antigen-binding 
fragments (Fabs) 10E8, 2F5 and 4E10 to be examined. In contrast to 
its higher neutralization potency, the dissociation constant (Kg) of 
10E8 to this MPER peptide was weaker than that of 2F5 and 4E10: 
17nM for 10E8 versus 3.8nM for 2F5 and 0.74nM for 4E10 
(Supplementary Fig. 4). Therefore, the affinity of 10E8 for the 
MPER in a soluble peptide format did not explain its greater neutral- 
ization potency compared to other MPER-specific antibodies. 


Prevalence of 10E8-like antibodies 


We next investigated the prevalence of MPER-specific and 10E8-like 
neutralizing antibodies in our cohort of HIV-1-infected donors. We 
selected 78 sera from our cohort with a neutralization ID59 > 100 
against at least one pseudovirus in a five-virus mini-panel’. The median 
time since diagnosis of these donors was 13.5 years, median CD4 count 
was 557 cells wl, median plasma HIV RNA 5,573 copies ml_1, and 
they were not receiving antiretrovirals. We tested neutralization against 
the HIV-2/HIV-1 chimaera C1 (Supplementary Table 4)*°. Of 78 sera, 
21 exhibited neutralization activity against the HIV-2/HIV-1 Cl virus 
(Supplementary Table 5). To map the region that was targeted by these 
sera, neutralization was measured using seven HIV-2/HIV-1 chimaeras 
containing subdomains of the MPER (Supplementary Table 4)*°. Of the 
21 sera with neutralization activity against the entire MPER, 8 exhibited 
a neutralization pattern similar to that observed for 10E8, which 
entailed neutralization of only those HIV-2/HIV-1 chimaeric viruses 
that contained the terminal residue of the MPER Arg 683 (C4, C4GW 
and C8; Supplementary Table 5). For further confirmation of these 
results, we used peptides corresponding to different portions of the 
MPER to block sera neutralization of the HIV-2/HIV-1 chimaera C1 
(Supplementary Table 6). Of the eight sera found to have a 10E8-like 
pattern based on neutralization of the chimaeras, three were blocked by 
peptides consistent with 10E8-like activity. An additional three of the 
eight 10E8-like sera were blocked by peptides in a pattern consistent 
with a combination of 10E8- and Z13e1-like antibodies. The six patients 
whose sera had 10E8-like activity did not differ from the remaining 72 
patients with regard to clinical course or HIV neutralization (Sup- 
plementary Fig. 5, legend). Overall, 27% of the tested patient sera 
exhibited anti-MPER neutralizing activity. This prevalence is con- 
siderably higher than observed in previous work, possibly related to 
selection of donors with known neutralizing activity*?°**. Further- 
more, 8% of the tested sera had 10E8-like antibodies (Supplementary 
Fig. 5), indicating that 10E8-like antibodies are not rare. 


Analysis of 10E8 autoreactivity 


A property common to the previously characterized MPER monoclonal 
antibodies 2F5 and 4E10 is that they crossreact with self-antigens”. In 
addition, binding to both the cell membrane and the Env trimer is 
thought to be important for optimal neutralization by these antibodies 
and this autoreactivity may be an obstacle to the elicitation of similar 
antibodies by a vaccine”. Surface plasmon resonance analysis showed 
that 10E8 did not bind to anionic phospholipids, such as phosphatidyl 
choline-cardiolipin (PC-CLP) and phosphatidyl choline-phosphatidyl 
serine (PC-PS) liposomes (Fig. 3a). 10E8 also did not bind HEp-2 
epithelial cells, in contrast to 2F5 and 4E10 that bound in a cytoplasmic 
and nuclear pattern (Fig. 3b). Additionally, 10E8 did not bind auto- 
antigens, such as Sjogren’s syndrome antigens A and B, Smith antigen, 
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Figure 3 | Analysis of 10E8 autoreactivity. a, SPR analysis of 10E8 binding to 
anionic phospholipids. 10E8 was injected over PC-CLP liposomes or PC-PS 
liposome immobilized on the BIAcore L1 sensor chip. 4E10 and 2F5 were used 
as positive controls and 13H11, 17b and anti-RSV F protein as negative 
controls. b, Reactivity of 10E8 with HEp-2 epithelial cells. Controls are as above 
with VRCO1 added as an additional negative control. Antibody concentration 
was 25 tg ml". All pictures are shown at X400 magnification. 


ribonucleoprotein, scleroderma 70 antigen, Jol antigen, centromere B 
and histone (Supplementary Table 7). Taken together, these results sug- 
gest that 10E8, in contrast to other MPER antibodies, is not autoreactive. 


Virion accessibility of 10E8 


The 2F5 and 4E10 antibodies have been shown to bind relatively 
poorly to the HIV-1 envelope spike on the surface of infected cells 
or to cell-free virions, and react more efficiently after Env engagement 
of the CD4 receptor*’. We measured binding to cleaved, full-length 
envelope spikes on HIVjrpry transfected cells (Supplementary Fig. 6a). 
Although 10E8 bound less efficiently than other antibodies such as 
VRCO1 or F105, where accessibility is not an issue, it bound more 
efficiently than either 2F5 or 4E10. In contrast to results of alanine 
substitution, a mutation in the 4E10 (F673S) region in full-length 
HIVjrpz envelope spikes enhanced 10E8 binding although the mecha- 
nism remains unclear. A mutation in the 2F5 (K665E) region had no 
influence on 10E8 binding. These data indicate that 10E8 has modestly 
greater access to the MPER epitope on the cell surface than either 2F5 
or 4E10. 

Toassess binding to cell-free virus, we incubated virions with antibody, 
washed out unbound antibody, and tested neutralization*’*’. During 
washing, antibodies that cannot access their Env target on free virions 
will be largely removed and therefore neutralization will be diminished. 
Asa control, neutralization of the HXBc2 isolate was not diminished by 
washing, because the MPER region is accessible on this laboratory- 
adapted isolate*'. Washing also had little impact on neutralization 
of JRFL by VRCO1. Consistent with previous work, 2F5 and 4E10 
neutralization of most virus isolates tested was substantially dimin- 
ished after washing (Supplementary Fig. 6b)*"”*. In contrast to 2F5 and 
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4E10, washing had a smaller effect on 10E8 neutralization of most 
viruses tested, as measured by the area under the curve or analysis of 
the fold-change in neutralization at a fixed inhibitory concentration 
(Supplementary Fig. 6c). Although 10E8 is not fully able to access its 
epitope on the native viral spike similarly to VRCO1, under most 
experimental conditions tested it was better able to access its epitope 
than either 2F5 or 4E10. 


Structure of the 10E8-gp41 complex 


To provide an atomic-level understanding of the interaction of 10E8 
with HIV-1, we crystallized the Fab of 10E8 in complex with a peptide 
encompassing the entire 28-residue gp41 MPER (residues 656-683). 
Monoclinic crystals diffracted to 2.1 A resolution, and structure solu- 
tion and refinement to Raye = 18.01% (Réree = 21.76%) revealed two 
complexes in the asymmetric unit (heretofore referred to as com- 
plexes 1 and 2) (Supplementary Table 8). Overall, 10E8 bound 
to one face of the MPER peptide, which formed two helices, each 
15-20 A in length and oriented 100° relative to each other (Fig. 4a). 
Electron density was observed for the entire MPER, ranging from 
Asn 656 to Arg 683 (Leu 660 to Arg 683 for complex 2), with the high- 
est degree of ordered density observed from residue Trp 666 within 
the N-terminal helix through to Arg 683 of the C-terminal helix 
(Supplementary Fig. 7). Analysis of main-chain dihedral angles 
(Supplementary Table 9) indicated that the N-terminal «-helix 
extends from residue Asn 657 to Ala 667, tightens into a 3) 9-helix 


Figure 4 | Crystal structure of 10E8 Fab in complex with its gp41 MPER 
epitope. a, 10E8 recognizes a highly conserved gp41 helix to neutralize HIV-1. 
Fab 10E8 is shown in ribbon representation (shades of violet for heavy chain 
(HC) and of grey for light chain (LC)) in complex with a gp41 peptide (red) that 
encompasses the MPER (Asn 656-Arg 683). b, Interface between 10E8 and 
gp41 with select 10E8 side chains (green, heavy chain; cyan, light chain) and 
gp41 side chains (grey) in stick representation. In analogy to a hand, the hinge 
can be viewed as being gripped by a thumb (represented by the CDR H2), the 
C-terminal helix as being suspended along a corresponding extended forefinger 
(represented by the CDR H3), and residues that commence the C-terminal 
helix as being caught in the cleft between the thumb and forefinger (represented 
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between residues Ser 668 and Leu 669, before turning at residues 
Trp 670 and Asn671. The C-terminal o-helix, capped by Asn 671, 
starts at residue Trp 672 and extends to residue Arg 683, the final 
residue of the MPER (Fig. 4a, b). 

The 10E8 antibody contacts the gp41 MPER primarily through its 
heavy chain, although crucial contacts are also mediated by the light 
chain CDR L3 (Fig. 4a-c and Supplementary Tables 10-12). Three 
predominant loci of interaction are observed between the antibody 
and gp41 (Supplementary Tables 13 and 14): one between residues of 
the tip of the CDR H3 loop and the tip of the C-terminal helix of the 
peptide, a second between residues of the CDR H2 loop and residues 
of the hinge region of the peptide, and a third at the juncture of the 
three heavy-chain CDR loops and the light chain CDR L3, which form 
a hydrophobic cleft that holds residues of the beginning of the MPER 
C-terminal helix (Fig. 4b). 


10E8-gp41 interface 


To complement the results observed for the mutagenesis of the 
highly conserved 10E8 epitope (Fig. 4d, f, h and Supplementary 
Tables 2 and 3), each residue of the 10E8 paratope, as determined 
from the crystal structure, was individually mutated to alanine and 
the resulting 25 10E8 variants assessed for affinity to a soluble 
MPER peptide. Overall, the most pronounced effects of the alanine 
mutations on the binding affinity of 10E8 to a soluble MPER peptide 
occurred within residues of the CDR H3 loop, although mutations 
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by the juncture of the CDR loops). (See ref. 22 for numbering.) c, d, Buried 
contact surfaces and epitope conservation. An examination of the buried 
contact surface on gp41 (grey; c) reveals that highly conserved epitope residues 
(labelled, d) are buried by 10E8 (Supplementary Tables 11 and 12; conservation 
percentages provided in parentheses). e-h, Alanine mutagenesis of paratope 
and epitope. Residues at the tip of the 10E8 CDR H3 loop and within the 
hydrophobic cleft are crucial for recognition of gp41 and for virus 
neutralization (Supplementary Tables 15 and 16), as mapped onto the buried 
10E8 contact surface (c, e, g). These results mirror the effects of alanine scan 
mutations of the 10E8 epitope (Supplementary Tables 2 and 3), as mapped onto 
the buried gp41 contact surface (f, h). 
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within the hydrophobic cleft also showed substantial effects (Fig. 4e, 
Supplementary Table 15 and Supplementary Fig. 8). 10E8 residues 
identified by an alanine scan as critical for the interaction with gp41 
stretched from the cleft all the way to the tip of the CDR H3 (Fig. 4e) 
and were mirrored by a corresponding stretch of gp41 residues that 
substantially affected 10E8 binding when mutated to alanine 
(Fig. 4f). 

The same panel of 10E8 alanine mutations was tested for neutrali- 
zation potency against a panel of five Env pseudoviruses that included 
both tier 1 and tier 2 viruses (Supplementary Table 16). Similar to the 
binding data, residues of the 1OE8 CDR H3 had marked effects on 
neutralization, as did residues of the hydropohobic cleft (Fig. 4g). 
Generally, Kg values of paratope mutants correlated with neutraliza- 
tion (Supplementary Fig. 9). Backbone interactions (on both 10E8 
and gp41) also contribute to the interface, especially between the 
CDR H2 of 10E8 and the hinge region of the MPER, although these 
are silent in alanine scan analyses. Overall, 10E8 uses a narrow band of 
residues (~20 X 5A) that stretches from the CDR H1 and H2 and 
extends along most of the CDR H3 to recognize a string of highly 
conserved hydrophobic gp41 residues, along with a critical charged 
residue (Arg/Lys 683), that occurs just before the transmembrane 
region (Fig. 4f, h). 


A conserved gp41-neutralization determinant 


Several structures of neutralizing antibodies in complex with the 
MPER of gp41 have been reported previously, including those for 
antibodies 2F5, Z13el and 4E10°°” (Supplementary Fig. 10a). The 
MPER adopts divergent loop conformations when bound by 2F5 and 
Z13e1 and an o-helix when bound by 4E10. Comparison of 2F5, 
Z13e1 and 4E10 epitopes with 10E8-bound gp41 revealed that only 
the 4E10 epitope has similar secondary structure, with superposition 
yielding a root mean squared deviation (r.m.s.d.) of 2.49 A for all 
atoms of residues 671-683 and 0.98 A for main-chain atoms (Sup- 
plementary Fig. 10b and Supplementary Table 17). 

To further compare the recognition of gp41 by 10E8 and 4E10, we 
examined their angles of epitope approach. As shown in Supplementary 
Fig. 10c-f, alignment of the recognized MPER helix places 10E8 and 
4E10 into similar overall spatial positions. The relative orientations of 
the recognized helix and the heavy and light chains of the two antibodies, 
however, differ markedly. With 10E8, the C-terminal helix is perpen- 
dicular to the plane bisecting the heavy and light chains (Supplementary 
Fig. 10c, e); with 4E10, the recognized helix is at the interface between the 
heavy and light chains (Supplementary Fig. 10d, f). Perhaps relevant to 
this, 10E8 uses CDR loops almost exclusively in its recognition of 
gp41, whereas 4E10 incorporates substantial $-strand interactions with 
gp4l. 

The differing modes of 10E8 and 4E10 recognition of the conserved 
C-terminal MPER helix result in a substantial difference in the pro- 
portion of the recognized helical face: 10E8 contacts roughly one-third 
of the helical face, whereas 4E10 contacts over half (Supplementary Fig. 
10g, h and Supplementary Tables 18 and 19). The smaller contact 
surface of 10E8 may provide an explanation for the reduced recog- 
nition of lipid surfaces by 10E8 versus 4E10, providing a potential 
structure-based explanation for reduced autoreactivity of 10E8. 


Sequence variation and 10E8 neutralization 

To place the specificity and structural data into the context of known 
variation of the MPER, we analysed viral sequences with resistance to 
neutralization by 10E8 (Fig. 5a). Of the 181 viruses tested, only 3 were 
highly resistant to 10E8 with ICso > 50 xg ml‘. Each of these viruses 
had substitutions at positions found to affect neutralization by alanine 
scanning (Asn 671, Trp 672, Phe 673 and Trp 680). Plasma virus from 
patient N152, from whom 10E8 was cloned, is also probably resistant 
to 10E8-mediated neutralization*®. Sequence analysis of plasma viral 
RNA revealed rare substitutions at positions Trp 680 and Lys/Arg 683 
(Fig. 5a). These residues are typically highly conserved with variation 
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Figure 5 | A site of gp41 vulnerability. a, Impact of sequence variation on 
10E8 neutralization. Predicted amino acid sequences within the binding 
epitope of 10E8 for three 10E8-resistant viruses and the patient virus are shown. 
The 10E8 epitope and differences in sequence compared to the JR2 virus are 
labelled in red. [C59 and ICgg values that are >20-fold compared with JR2 wild- 
type pseudovirus are highlighted in yellow. Error bars denote one s.e.m. 

b, Structural definition of a highly conserved region of gp41 recognized by 
neutralizing antibodies. Atoms of highly conserved residues that make direct 
contacts with 10E8 (crystal complex 1) are coloured red and shown in stick 
representation. Remaining atoms buried by 10E8 are coloured purple, and 
those making main-chain contacts are coloured cyan. Semi-transparent 
surfaces of the gp41 MPER are coloured according to the underlying atoms. 90° 
views are shown, with bound antibody 10E8 in the right panel. The 10E8 CDR 
H3 interacts with highly conserved hydrophobic residues, whereas the CDR H2 
contacts main-chain atoms at the juncture between the N- and C-terminal 
helices. Many of the unbound residues of the MPER (grey) are hydrophobic, 
especially those within the C-terminal helix. In the structure of the late fusion 
intermediate (Supplementary Fig. 11), gp41 residues that contact 10E8 largely 
face the outside of the helical coiled-coil. 


only occurring in 1.17% of 3,730 HIV Env sequences in the Los Alamos 
Database (http://www.hiv.lanl.gov). When the substitutions for the 
three resistant viruses and the patient viruses were placed on the back- 
ground of the sensitive JR2 virus, substitutions at Asn671Thr, 
Trp672Leu and Phe673Leu had a modest effect on the ICso values 
but raised the ICgo values above 20 jig ml’. In the structural analysis, 
direct contacts with 10E8 were not observed at position 671, indicating 
that the effects on neutralization of Thr or Ala substitutions at this 
position are mediated by conformational or other effects within gp41. 
The combination of Trp672Leu and Phe673Leu conferred high-level 
resistance at the IC;) and ICgp level. Changes corresponding to the 
patient’s dominant circulating virus had a similar effect. Although Lys/ 
Arg683Gln alone conferred resistance at the ICgo level, together 
Trp680Arg and Lys/Arg683Gln resulted in greater resistance to 10E8 
(Fig. 5a). When taken together with the analysis of the 10E8 paratope, 
these data suggest that in addition to Trp 672, Phe 673 and Trp 680 
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found in the 4E10 epitope, the additional 10E8-bound residue Lys/ 
Arg683 is critical to neutralization. In addition to other differences 
in binding based on structural analyses noted above, it is possible that 
the additional potency of 10E8 compared to 4E10 against naturally 
occurring viral variants may be mediated through binding of highly 
conserved residues Trp 680 and Lys/Arg 683 that directly interact with 
the 10E8 CDRH3. 


Discussion 


10E8 is a broad and potent neutralizing antibody with important 
implications for efforts to induce such antibodies with vaccines. 
Previous MPER antibodies were limited in potency, and had a more 
limited ability to access MPER on Env of primary isolates. In addition, 
lipid binding and autoreactivity were thought to be characteristics 
of MPER antibodies and important obstacles to their elicitation by 
vaccines””””, However, 10E8 lacks each of these characteristics. In 
addition, antibodies with a similar specificity were not rare in our 
chronically infected cohort. This suggests that 10E8-like antibodies 
were not deleted from the repertoire because of autoreactivity. These 
results further suggest that 10E8-like antibodies might be raised in a 
larger fraction of HIV-uninfected persons receiving a vaccine 
designed to elicit these antibodies without the B-cell defects of chronic 
HIV infection. Design of such a vaccine will probably require not 
only presentation of an intact 10E8 epitope but also use of a 
platform sufficiently immunogenic to drive the evolution of 10E8-like 
antibodies. 

The extraordinary breadth and potency of 10E8 seems to be 
mediated by its ability to bind highly conserved residues within 
MPER. Although the epitope of 10E8 overlaps those of known 
monoclonal antibodies such as 4E10, it differs in recognition surface, 
angle of approach, lipid binding and self-reactivity. Alanine scan- 
ning, structural analysis and paratope analysis each indicate that 
10E8 makes crucial contacts with highly conserved residues 
Trp 672, Phe673, Trp676 and Lys/Arg683. The extraordinary 
breadth of some potent monoclonal antibodies, for example that 
bind the CD4 binding site, is thought to be conferred by blocking 
a functionally important site that is critical for viral entry. Whether 
10E8 impairs Env function or simply acts by binding highly 
conserved residues remains to be determined. Nonetheless, the 
breadth and potency of 10E8 demonstrates a conserved site of 
gp41 vulnerability (Fig. 5b) that is an important target antigen for 
HIV neutralization and that will probably reinvigorate interest in 
MPER-based HIV vaccine design. 


METHODS SUMMARY 


Peripheral blood CD19" IgM IgD IgA” B cells were sorted by flow cytometry, 
plated at 4 cells per well, and expanded with cytokines and feeder cells. B-cell 
culture supernatants were screened by microneutralization against HIV 03 and 
HIVgai26 pseudoviruses. IgG genes from wells with neutralization activity were 
cloned and re-expressed in 293T cells. Breadth of neutralizing activity was con- 
firmed against a 181-isolate Env pseudovirus panel. Specificity was determined by 
alanine scanning peptides and mutant Env pseudoviruses. Lipid binding and 
autoreactivity of 10E8 were measured by SPR plasmon resonance, indirect immu- 
nofluorescence on HEp-2 cells and bead arrays. Binding of HIV envelopes on 
transfected 293 cells was detected by flow cytometry. After pre-incubation with 
antibody, the impact of washing virions before infecting TZM-bl cells was used to 
measure access to viral MPER. The frequency of HIV-1~ sera with a given 
specificity was measured by the ability to neutralize HIV-2/HIV-1 chimaeras 
containing portions of the MPER. 10E8 was co-crystallized with a peptide encom- 
passing the entire 28-residue gp41 MPER (residues 656-683). Structure deter- 
mination revealed two complexes in the crystal asymmetric unit. Analysis of 
differences between the two complexes enabled essential interactions to be dis- 
cerned. The paratope, as defined by residues in the antibody that showed reduced 
solvent accessibility when in complex with gp41, was subjected to comprehensive 
alanine scan, with each of the 25 10E8 alanine mutants assessed by SPR for 
recognition of gp41 and for neutralization of a panel of 5 pseudotyped viruses. 
The sequence of the patient plasma viral RNA was derived using limiting dilution 
RT-PCR. 
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Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Study patients. We selected the plasma and peripheral blood mononuclear cells 
(PBMCs) from the HIV-1-infected patients enrolled in the National Institute of 
Health under a clinical protocol approved by the Investigational Review Board in 
the National Institute of Allergy and Infectious Diseases (NIAID-IRB). All parti- 
cipants signed informed consent approved by the NIAID-IRB. The criteria for 
enrolment were as follows: having a detectable viral load, a stable CD4 T-cell 
count above 400 cells ul’, being diagnosed with HIV infection for at least 4 years, 
and off antiretroviral treatment for at least 5 years. On the basis of the locations of 
current and former residences, all patients were presumed to be infected with 
clade B virus. Donor N152 was selected for B-cell sorting and antibody generation 
because his serum neutralizing activity is among the most potent and broad in our 
cohort. He is a slow progressor based on criteria described previously*'. At the 
time of leukapheresis, he had been infected with HIV-1 for 20 years, with CD4 
T-cell counts of 325 cells ule; plasma HIV-1 RNA values of 3,811 copies ml! 
and was not on antiretroviral treatment. 

Viruses and plasmids. HIV-1 JR2 MPER alanine mutant pseudovirus plasmids 
were obtained from M. Zwick (The Scripps Research Institute). HIV-2/HIV-1 
chimaeras were provided by G. Shaw and L. Morris. 

Memory B-cell staining, sorting and antibody cloning. Staining and single-cell 
sorting of memory B cells were performed as follows. PBMCs from HIV-1-infected 
donor N152 were stained with antibody cocktail consisting of anti-CD19-PE-Cy7 
(BD Bioscience), IgA-APC (Jackson ImmunoResearch Laboratories Inc.), IgD- 
FITC (BD Pharmingen) and IgM-PE (Jackson ImmunoResearch Laboratories 
Inc.) at 4°C in dark for 30 min. The cells were then washed with 10 ml PBS- 
BSA buffer and re-suspended in 500u1 PBS-BSA. A total of 66,000 
CD19*IgA IgD IgM” memory B cells were sorted using a FACSAria III cell 
sorter (Becton Dickinson) and re-suspended in IMDM medium with 10% FBS 
containing 100 U ml ' IL-2, 50ngml ? IL-21 and 1 X 10° ml’ irradiated 3T3- 
msCD40L feeder cells*”. B cells were seeded into 384-well microtitre plates at a 
density of 4 cells per well in a final volume of 50 pl. After 13 days of incubation, 
40 ul of culture supernatants from each well were collected and screened for 
neutralization activity using a high-throughput micro-neutralization assay against 
HIV-1yn.3 and HIV-1ga126. B cells in each well were lysed with 20 il lysis buffer 
containing 0.25 ul of RNase inhibitor (New England Biolabs Inc.), 0.3 pl of 1M 
Tris pH 8 (Quality Biological Inc.) and 19.45 pl DEPC-treated H,O. The plates 
with B cells were stored at —80 °C. 

The variable region of the heavy chain and the light chain of the immuno- 
globulin gene were amplified by RT-PCR from the wells that scored positive in 
both the HIV-1yn.3 and HIV-1 ,).26 neutralization assay. The cDNA product was 
used as template in the PCR reaction. To amplify the highly somatically mutated 
immunoglobulin gene, two sets of primers as described previously~' were used in 
two independent PCRs. One set of primers consisted of the forward primers 
and the reverse primers specific for the leader region and constant region of 
IgH, Igk or Igd, respectively. The other set of primers consisted of the forward 
primer mixes specific for FWRI and respective reverse primers specific for 
the IgH, Igk and Iga J genes. All PCRs were performed in 96-well PCR plates 
in a total volume of 50 ul containing 20 nM each primer or primer mix, 10 nM 
each dNTP (Invitrogen), 10 pl 5X Q-solution (Qiagen) and 1.2 U HotStar Taq 
DNA polymerase (Qiagen). From the positive PCR reactions, pools of the VH or 
V-region DNA were ligated to a pCR2.1-Topo-TA vector (Invitrogen) for 
sequencing before cloning into the corresponding Igy1, Igk and Igi expression 
vector. 10 ug of heavy and light chain plasmids, cloned from the same well and 
combined in all possible heavy and light chain pairs, were mixed with 40 pl 
FuGENE 6 (Roche) in 1,500 pl DMEM (Gibco) and co-transfected into 293T cells. 
The full-length IgG was purified using a recombinant protein-A column (GE 
Healthcare). 

Neutralization assays. Neutralization of the monoclonal antibodies was measured 
using single-round HIV-1 Env-pseudovirus infection of TZM-bl cells. HIV-1 Env 
pseudoviruses were generated by co-transfection of 293T cells with pSG3 delta Env 
backbone and a second plasmid that expressed HIV-1 Env. At 72 h after transfec- 
tion, supernatants containing pseudovirus were harvested and frozen at —80°C 
until further use. In the neutralization assay, 10 jl of fivefold serially diluted patient 
serum or monoclonal antibody was incubated with 40 il of pseudovirus in a 96-well 
plate at 37 °C for 30 min before addition of TZM-bl cells. After 2 days of incubation, 
cells were lysed and the viral infectivity was quantified by measuring luciferase 
activity with a Victor Light luminometer (Perkin Elmer). The 50% inhibitory con- 
centration (ICs) was calculated as the antibody concentration that reduced infec- 
tion by 50%. Antibody epitopes were mapped using HIV-1 JR2 MPER alanine 
mutant pseudoviruses in a TZM-bl assay. 

HIV-2/HIV-1 chimaera neutralization. HIV-2/HIV-1 Cl chimaera (HIV-2 
virus 7312A with HIV-1 gp41 MPER)”° was used in the competition assay. A 
fixed concentration of MPER peptide was incubated with serially diluted 2F5, 
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4E10, Z13e1 or 10E8 antibody at 37 °C for 30 min before incubation with HIV-2/ 
HIV-1 Cl chimaera. Wild-type HIV-2 virus 7312A was used as a control. 
Antibody epitope mapping was completed by adding 10 pl 10E8 monoclonal 
antibody to 5 l serial dilutions of 4E10 peptide or its alanine mutants at 37 °C 
for 30 min before the addition of HIV-2/HIV-1 C1 chimaera. The degree to which 
peptides blocked antibody-mediated neutralization was calculated as the fold 
change in the ICs value of the antibody in the presence of 4E10 alanine mutants 
compared to the wild-type peptide. The precise binding region within the MPER 
targeted by patient serum or antibodies was determined using the HIV-2/HIV-1 
chimaeras containing different portions of HIV-1 MPER, such as C1 (HIV-2 Env 
with HIV-1 MPER), C1C (HIV-2 Env with clade C MPER), C3 (HIV-2 Env with 
2F5 epitope), C4 (HIV-2 Env with 4E10 epitope), C6 (HIV-2 Env with short 4E10 
epitope NWEDIT), C7 (HIV-2 Env with short 2F5 epitope ALDKWA) and C8 
(HIV-2 Env with both Z13 and 4E10 epitope). Fivefold diluted patient serum or 
monoclonal antibody was incubated with chimaera in a 96-well plate at 37 °C for 
30 min before addition of TZM-bl cells. The specificities within patient sera were 
confirmed by blocking neutralization of the C1 chimaera with 25 pg ml! of 2F5, 
4E10, MPER, Bal.V3, control peptide, or 50 1g ml! of Z13 peptide. 

ELISA assays. Each antigen at 2 1g ml’ was coated on 96-well plates overnight 
at 4 °C. Plates were blocked with BLOTTO buffer (PBS, 1% FBS, 5% non-fat milk) 
for 1 hat room temperature, followed by incubation with antibody serially diluted 
in disruption buffer (PBS, 5% FBS, 2% BSA, 1% Tween-20) for 1h at room 
temperature. 1:10,000 dilution of horseradish peroxidase (HRP)-conjugated goat 
anti-human IgG antibody was added for 1 h at room temperature. Plates were 
washed between each step with 0.2% Tween 20 in PBS. Plates were developed 
using 3,3’,5,5'-tetramethylbenzidine (TMB) (Sigma) and read at 450 nm. 
Autoreactivity assays. Binding of 10E8 to phospholipid was measured by SPR 
conducted on a BIAcore 3000 instrument and data analyses were performed using 
the BIAevaluation 4.1 software (BIAcore) as described previously*®. Phospholipid- 
containing liposomes were captured on a BlAcore L1 sensor chip, which uses an 
alkyl linker for anchoring lipids. Before capturing lipids, the surface of the L1 chip 
was cleaned with a 60-s injection of 40 mM octyl-B-p-glucopyranoside, at 100 pl 
min ', and the chip and fluidics were washed with excess buffer to remove any 
traces of detergent. Monoclonal antibodies were then injected at 100 .gml | at a 
flow rate of 30ulmin™’. After each antibody injection, the surface was again 
cleaned with octyl-B-p-glucopyranoside, and 5-s injections of each 5mM HCl, 
then 5mM NaOH, to clean any adherent protein from the chip. 

Reactivity to HIV-1 negative human epithelial (HEp-2) cells was determined 
by indirect immunofluorescence on slides using Evans Blue as a counterstain and 
FITC-conjugated goat anti-human IgG (Zeus Scientific)”. Slides were photo- 
graphed on a Nikon Optiphot fluorescence microscope. Regarding Fig. 3b, koda- 
chrome slides were taken of each monoclonal antibody binding to HEp-2 cells at a 
10-s exposure, and the slides scanned into digital format. The Luminex AtheNA 
Multi-Lyte ANA test (Wampole Laboratories) was used to test for monoclonal 
antibody reactivity to SSA/Ro, SS-B/La, Sm, ribonucleoprotein (RNP), Jo-1, double- 
stranded DNA, centromere B, and histone and was performed as per the manu- 
facturer’s specifications and as previously described”. Monoclonal antibody 
concentrations assayed were 50, 25, 12.5 and 6.25 wg ml !. 10 pil of each concen- 
tration were incubated with the luminex fluorescent beads and the test performed 
per the manufacturer’s specifications. 

Fluorescence-activated cell sorting (FACS) staining of cell-surface HIV-1 Env. 
FACS staining was performed as previously described*!“*. Forty-eight hours after 
transfection, cells were collected and washed in FACS buffer (PBS, 5% HIFBS, 
0.02% azide) and stained with monoclonal antibodies. The transfected cells were 
suspended in FACS buffer and were incubated with the antibodies for 1 h at room 
temperature. The monoclonal antibody-cell mixture was washed extensively in 
FACS buffer and phycoerythrin (PE)-conjugated goat anti-human secondary 
antibody (Sigma) was added for 1h at a 1:200 dilution, followed by extensive 
washing to remove unbound secondary antibody. The antibody-PE-stained cells 
were acquired on a BD LSRII instrument and analysed by FlowJo. 

Antibody-virus washout experiments. From a starting concentration of 2 mg 
ml ', 12.5 pl of fivefold serially diluted antibodies in PBS were added to 487.5 pl 
of DMEM containing 10% heat-inactivated FBS and 15 ul of pseudovirus such 
that the final concentrations of antibodies were 50 ug ml! to 0.08 pg ml? in a 
total volume of 500 pl. In the ‘no inhibitor’ control, the same volume of PBS was 
added instead of antibody. The reaction mixture was incubated for 30 min at 
37°C. The 250 ll reaction mixture was diluted to 10 ml with complete DMEM, 
centrifuged at 25,000 r.p.m. in a SW41 rotor, for 2h at 4 °C. The virus pellet was 
then washed two additional times with 10 ml of PBS. During the washing steps, 
the virus—antibody complex was centrifuged at 40,000 r.p.m. for 20 min at 4°C. 
After the final wash, 250 pl of DMEM was added to the washed virus pellet and it 
was re-suspended by gentle shaking at 4°C for 30 min. A total of 100 pl of the 
suspended virus was used to infect 100 il of TZM-bl cells (0.2 million per ml), in 
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duplicate. From the remaining 250 ul of reaction mixture, an equal volume of the 
antibody virus mixture was used as a ‘no washout’ control. Plates were incubated 
at 37 °C in a CO, incubator for 2 days. After 2 days, the luciferase assay was done 
as described previously**. The data were then plotted to determine the neutral- 
ization mediated by the antibodies in ‘wash’ or ‘no wash’ conditions. 

Structure determination and analysis. The antigen binding fragment of 10E8 
(Fab) was prepared using LysC digestion, as previously described**. The IgG was 
first reduced with 100 mM dithiothreitol (DTT) for 1 h at 37 °C, followed by 1h of 
dialysis in HEPES, pH 7.6, to reduce the DTT concentration to 1 mM. Antibodies 
were then dialysed against 2 mM iodoacetamide for 48 h at 4 °C, and subjected to a 
final dialysis against HEPES, pH 7.6, for 2 h. After reduction and alkylation, anti- 
bodies were cleaved with Lys-C (Roche). Fab was purified by ion exchange (Mono 
S) and size-exclusion chromatography (S200). Purified 10E8 Fab was incubated 
with tenfold excess peptide RRR-NEQELLELDKWASLWNWEDITNWLWYIR- 
RRR (American Peptide) and the complex then set up for 20°C vapour diffusion 
sitting-drop crystallizations on a Honeybee 963 robot. A total of 576 initial condi- 
tions adapted from the commercially available Hampton (Hampton Research), 
Precipitant Synergy (Emerald Biosystems) and Wizard (Emerald Biosystems) crys- 
tallization screens were set up and imaged using the Rockimager (Formulatrix), 
followed by hand optimization of crystal hits. Crystals grown in 40% PEG 400, 
0.1 M NaCitrate, 0.1 M Tris pH 7.5 diffracted to 2.1 A resolution ina cryoprotectant 
composed of mother liquor supplemented with15% 2R-3R-butanediol and excess 
peptide. After mounting the crystals on a loop, they were flash cooled and data 
collected at 1.00 A wavelength at SER CAT ID-22 or BM-22 beamlines (APS) and 
processed using HKL-2000*. Structures were solved through molecular replace- 
ment with Phaser**”’, using a previously obtained free structure of 10E8 as a search 
model. Refinement of the structure was undertaken with Phenix’, with iterative 
model building using Coot*!. The structure was validated with MolProbity”, yield- 
ing 97% and 99.8% of residues falling within most favoured and allowed 
Ramachandran regions, respectively. The structure was analysed with APBS* for 
electrostatics, Ligplot™ for direct contacts, PISA® for buried surface areas, and 
Isqkab (ccp4 Package”®) for r.m.s.d. alignments. Helical wheels were generated using 
the program Pepwheel (http://150.185.138.86/cgi-bin/emboss/pepwheel). All 
graphics were prepared with Pymol (PyMOL Molecular Graphics System). 
Assessment of binding affinities of 1OE8 and 10E8 variants to the gp41 MPER. 
Surface-Plasmon Resonance (SPR) (Biacore T200, GE Healthcare) was used to 
assess binding affinity of wild-type 10E8 to a gp41 MPER peptide. A biotinylated 
peptide composed of residues 656-683 of the gp41 MPER (RRR- 
NEQELLELDKWASLWNWEDITNWLWYIR-RRK-biotin; American Peptide) 
was coupled to a biacore SA chip to a surface density of 20-50 response units 
(RU). The 10E8 fragment of antigen binding (Fab) was then flowed over as 
analyte at concentrations ranging from 0.25nM to 125 nM, at twofold serial 
dilutions, with association and dissociation phases of up to 5 min, at a flow rate 
of 30 pl min’. The binding of the 2F5 and 4E10 Fab controls to the same peptide 
was examined under identical conditions. 

Binding affinities of the 10E8 paratope alanine mutants to the MPER were also 
assessed with SPR, but using an antibody capture method. A Biacore CM5 chip 
was amine-coupled with anti-human Fc antibody to high surface densities of 
~10,000 RU. The 10E8 paratope variant IgGs were then captured to between 
1,500-2,500 RU and a peptide composed of residues 656-683 of the gp41 MPER 
(RRR-NEQELLELDKWASLWNWEDITNWLWYIR-RRR) flowed over as ana- 
lyte at twofold serial dilutions starting at 500 nM (with the exception of HC D30A, 
WI100bA, S100cA, P100fA, which started at 250 nM). Association and dissoci- 
ation phases spanned 3 min and 5 min, respectively, at a flow rate of 30 pl min’. 
Binding sensograms were fit with 1:1 Langmuir models using Biacore 
BiaEvaluation Software (GE Healthcare). In all cases, Biacore HBSEP+ buffer 
was used (10 mM HEPES, pH 7.4, 150 mM NaCl, 3 mM EDTA, 0.1% P-20). 


PCR amplification and sequencing. Extraction of viral RNA from plasma and 
cDNA synthesis were performed as previously described”’. Single molecules of a 
588-bp fragment, encompassing the MPER region of the HIV-1 envelope gene, 
obtained through limiting dilution, were PCR-amplified with the Expand High 
Fidelity PCR System (Roche Applied Science) using the following primer sets: 
+7789 (sense) 5’-TCTTAGGAGCAGCAGGAAGCACTATGGG-3’ and —8524 
(antisense) 5'-GTAAGTCTCTCAAGCGGTGGTAGC-3’ in a first round reac- 
tion; +7850 (sense) 5’-ACAATTATTGTCTGGTATAGTGCAACAGCA-3’ 
and —8413 (antisense) 5’-CCACCTTCTTCTTCGATTCCTTCGG-3’ in a 
second round reaction. Each round of PCR consisted of 25 cycles, with the initial 
denaturation at 94°C for 2 min, followed 25 cycles of denaturation at 94°C for 
15s, annealing at 50°C for 30s, and extension at 72°C for 1 min, with the final 
extension at 72 °C for 7 min. The PCR products were purified with the QIA quick 
PCR purification kit (Qiagen), and then cloned into pCR2.1-TOPO vector 
(TOPO TA Cloning it, Invitrogen) for sequence analysis of individual molecular 
clones. The DNAs from 18 independent clones were sequenced with the ABI 
BigDye Terminator v3.1 Ready Reaction Cycle Sequencing kit (Applied 
Biosystems) and analysed with the ABI PRISM 3130xl Genetic Analyzer 
(Applied Biosystems). 

Statistical analysis. The relationship between the potency of N152 patient serum 
and 10E8, and the relationship between 10E8 variant binding and neutralization 
were evaluated by the Spearman rank method. 
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The Mu transpososome structure sheds 
light on DDE recombinase evolution 


Sherwin P. Montafio', Ying Z. Pigli’ & Phoebe A. Rice! 


Studies of bacteriophage Mu transposition paved the way for understanding retroviral integration and V(D)J 
recombination as well as many other DNA transposition reactions. Here we report the structure of the Mu 
transpososome—Mu transposase (MuA) in complex with bacteriophage DNA ends and target DNA—determined from 
data that extend anisotropically to 5. 2A, 5.2A and 3.74 resolution, in conjunction with previously determined 
structures of individual domains. The highly intertwined structure illustrates why chemical activity depends on 
formation of the synaptic complex, and reveals that individual domains have different roles when bound to different 
sites. The structure also provides explanations for the increased stability of the final product complex and for its 
preferential recognition by the ATP-dependent unfoldase ClpX. Although MuA and many other recombinases share a 
structurally conserved ‘DDE’ catalytic domain, comparisons among the limited set of available complex structures 
indicate that some conserved features, such as catalysis in trans and target DNA bending, arose through convergent 


evolution because they are important for function. 


Mobile DNA elements are important in many aspects of biology, such 
as disease, evolution and the spread of antibiotic resistance, and the 
recombinases they encode, including MuA, are useful genetic tools’? 
The DNA transposition system of bacteriophage Mu was the first to 
be developed in vitro’. MuA, many other DNA transposases, and 
retroviral integrases share a conserved RNaseH-like or DDE catalytic 
domain, named for the three Mg”*-binding carboxylates in their 
active sites*. Structural studies have lagged behind biochemical ones: 
only three family members have been co-crystallized in active, DNA- 
bound complexes, and only one with target DNA*’. Despite mech- 
anistic similarities, only the catalytic domain is conserved among all of 
these, and their overall architectures are completely different*. More 
examples are needed to understand the diverse ways in which these 
enzymes harness a common catalytic domain to accomplish trans- 
position. The richness of the known biochemistry for bacteriophage 
Mu, from assembly of the initial complex to targeted disassembly of 
the product complex, makes it a particularly informative example for 
structural studies. 


The first steps of Mu transposition (Fig. 1) are common to many other 
DNA transposition systems as well as retroviral integration: (1) pairing 
of the mobile element’s ends by the recombinase to form a ‘transposo- 
some’ or ‘intasome’; (2) hydrolytic nicking at the bacteriophage-host 
junction; and (3) attack of the newly freed 3’ hydroxyls ona target DNA 
(strand transfer), creating a new connectivity. Bacteriophage Mu uses 
this mechanism to form a lysogen, and to replicate when it becomes lytic. 
During the lytic phase, host enzymes are recruited to convert the 
branched product into replication forks, resulting in duplication of the 
entire bacteriophage genome. However, during initial lysogen forma- 
tion, the ‘flanking host’ DNA (grey in Fig. 1) consists only of extra 
sequences appended during packaging of the bacteriophage Mu DNA 
into bacteriophage capsids. In this case, a poorly understood signal 
causes the transposase to cleave both strands at each genome end, lead- 
ing to a simple insertion without replication®”. 

MuA is chemically active only when incorporated into transposo- 
somes that pair the two ends of the phage genome, and thus assembly 
of this complex is a regulatory step. Mu transpososomes become 
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Figure 1 | Transposition pathway and structure 
determination. a, Cartoon of transposition. The 
transposase (MuA) pairs the bacteriophage genome 
ends (blue and red). At each end, the same active site 
catalyses the attack of HO at the phage—host 
junction and then the direct attack of the phage 3’- 
OH on target DNA (‘strand transfer’). Target binding 
is nonspecific, and there is a 5-bp stagger between the 
sites of attack. Host and target DNAs may be entire 
circular replicons. After the ATP-dependent 
unfoldase ClpX disassembles the final strand transfer 
complex, the 3’ hydroxyls are used as replication 
primers, resulting in duplication of the bacteriophage 
genome. Our crystals contain the strand transfer 
product (third panel). b, Domain structure of MuA. 
c, Experimental electron density map after phase 


improvement with Parrot superimposed on the 
model (contours are 1.2 and 20). 


1Department of Biochemistry and Molecular Biology, The University of Chicago, Chicago, Illinois 60637, USA. 


15 NOVEMBER 2012 | VOL 491 | NATURE | 413 


©2012 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


increasingly stable as the reaction progresses'*"’. After strand transfer 
the complex is so stable that the enzyme MuA does not actually turn 
over. The strand transfer reaction can only be reversed if the complex 
is disrupted, for instance by heating to 75 °C (refs 12, 13). This may be 
a thermodynamic necessity for a reaction in which the first step 
(hydrolysis) is committed, yet the second step (strand transfer) is 
chemically isoenergetic, with no net change in the number of phos- 
phodiester bonds. In vivo, Mu transpososomes must be disassembled 
by the ATP-dependent unfoldase ClpX before DNA replication can 
proceed'*"’. From the transpososome structure it is possible to derive 
explanations for the increased stability of the final complex and its 
preferential recognition by ClpX. 

Crystallization of the strand transfer complex was facilitated by two 
observations. First, despite a lack of target sequence specificity, MuA 
attacks mismatch-containing target DNA with single-nucleotide pre- 
cision (Supplementary Fig. 1)’. Second, the natural transpososome 
assembly pathway (described below) can be simplified under permis- 
sive conditions in vitro. The resulting active complexes contain four 
copies of the MuA protein and two copies of a ~50-base-pair (bp) 
DNA derived from the bacteriophage genome’s right end, each car- 
rying two MuA binding sites (termed R1 and R2)*"”. Modelling based 
on the structure of these complexes suggests that transpososomes 
formed on full left and right ends are quite similar. 


Overall architecture 


Viewed in isolation, the five domains of each subunit resemble beads 
ona string (Supplementary Fig. 2). However, when all four subunits are 
assembled on the DNA, they intertwine to form a network of protein- 
protein and protein-DNA interactions (Fig. 2 and Supplementary 
Video 2). The overall transpososome resembles a pair of scissors, with 
the bacteriophage end DNAs forming the handles and the sharply bent 
target DNA the blades. A 34A resolution electron microscopy (EM) 
reconstruction of Mu transpososomes in the absence of target DNA 
found a similar V-shape, although the arms were shorter and the 
accompanying electron spectroscopic imaging predicted a more con- 
torted path for the DNA”. Within the transpososome, most of the 
individual protein domains perform different roles in the R1- versus 
R2-bound subunits. Where a system encoded by a larger genome 
might, during the course of evolution, use two separate polypeptides, 
bacteriophage Mu re-uses the same sequence to perform different 
functions within the complex. 


Catalysis and Mu DNA end-binding in trans 

The catalytic sites lie within domain IIx of MuA. In agreement with 
biochemical studies, only the Rl-bound subunits’ active sites engage 
with DNA, and they do so in trans: for example, the dark-blue subunit 
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binds the blue Mu DNA via domains If and y, whereas its active site 
domain docks at the red Mu DNA-target junction (the interdomain 
linkers are too short for any other connectivity) (Fig. 3). First cha- 
racterized for Mu transposition”, such trans catalysis is a recurring 
theme in DNA transposition, and helps to render chemical activity 
dependent on full complex assembly*. Domains I and IIB of the R2 
subunits bridge the two Mu end DNAs and have a primarily structural 
role. IIx of each R2 subunit interacts with the DNA-binding domain 
(DBD) If of the subunit bound to the R1 site of the same DNA seg- 
ment, whereas IIB of each R2 subunit binds to the opposite DNA 
segment. Biochemical studies had predicted domain IIf to interact 
with the target DNA, which does occur in the R1 subunits”. 

Domains If and Iy of MuA recognize the specific binding sites on 
the bacteriophage DNA ends and their positions agree with foot- 
printing and mutagenesis data**”’. The closest structural match to 
MuA’s tandem DNA binding domains is the centromere binding pro- 
tein CENP-B, which probably evolved from an ancestral transposase”®. 
The eukaryotic mariner family Mos1 and Tc3 transposase structures 
also include tandem DBDs’””’. In both, contacts between the DBDs 
equivalent to domain IB of MuA mediate synapsis of the two trans- 
poson ends. In the Mu transpososome, only the Rl-bound DBDs 
mediate protein-protein contacts: If as described above, and Iy to both 
Ila of the R1 subunit at the other end of the Mu bacteriophage and 
IIIa of the R2 subunit at the same end of the Mu bacteriophage. The 
importance of these interactions is underscored by the high sequence 
conservation within the interaction surfaces (Supplementary Fig. 3). 

Although the resolution precludes detailed analysis of DNA bending, 
the path of the backbone is clear. The R2 site is bent by ~28°, largely 
through compression of the major groove around domain IB, which 
agrees with DNase I hypersensitive sites and the CENP-B-DNA struc- 
ture'°*?®, The RI site is less bent (~17°). Stronger bending there 
would cause a steric clash between domain Iy of the Rl-bound subunit 
and the B-barrel (IIB) of the adjacent R2 subunit. We propose that the 
RI site straightens somewhat upon transpososome formation. The 
formation of favourable protein-DNA and protein-protein contacts 
within the transpososome could offset the cost of weakening contacts 
between DNA and domain IB. 

Larger DNA conformational changes may occur on transpososome 
formation, as indicated by solution experiments that found ~90° 
bends in monomeric MuA-DNA complexes”. In monomeric MuA, 
domain IIo could interact in cis with its own IB domain and additional 
DNA bending might be induced by electrostatic interactions with IIB 
and II«. A different but stable monomer conformation would raise 
the energy barrier to spontaneous tetramer formation, making it more 
amenable to regulation. It could also prevent premature encounters 
between the active site and the DNA. 


Figure 2 | Transpososome 
structure. The complex sits on a 
crystallographic two-fold symmetry 
axis (vertical) that relates the blue 
and red halves. The pale- and dark- 
coloured subunits adopt different 
conformations within the 
homotetramer. DNA colours match 
Fig. 1. a, Cartoon. Catalytic sites are 
marked as yellow and tan stars 
(facing the viewer or the background, 
respectively) and domains of the blue 
subunits are labelled. b, Ribbon 
drawing, with the scissile phosphate 
groups shown as yellow spheres. 

c, Same drawing as in b, rotated ~90° 
about a vertical axis. 
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Target DNA 


The target DNA is bent through a total of ~140°. Protein-DNA con- 
tacts are mediated by the R1 subunits’ domains Ilo, IIB and IIa and 
extend to all but the outermost base pairs of the target DNA, as pre- 
dicted by footprinting”. A long loop (residues 410-430) extends from 
each catalytic domain (IIx) underneath the target. This loop could be 
fit onto experimental electron density by a rigid-body rotation from its 
position in the 2.8 A domain Ilo structure (Protein Data Bank accession 
1bcm)**. The positively charged B-barrel, domain IIf, interacts loosely 
with the outer end of the target. Although poorly ordered, its position is 
defined by the SeMet signal from two adjacent Met residues. The 
relative orientation between domains IIo and IIB of the R1 subunit is 
shifted slightly from that in the unbound protein and in the R2 subunit. 
This alters the connecting loop, which also lies near the target, but 
could not be modelled. Finally, the domain Ilus of the R1 subunits 
pair to form a coiled coil on the concave side of the bent target DNA. 


Two roles for domain IIIa 

Domain IIx, the final ~45 residues in our structure, also has different 
roles in the Rl compared with the R2 subunits. It is highly positively 
charged and binds DNA as an isolated peptide*’. The domain Ilus of 
the R1 subunits appear to stabilize the bent target DNA in two ways: (1) 
alleviating electrostatic repulsion between the sides of the U-shaped 
target; and (2) physically trapping the target DNA within the complex. 
In the absence of target DNA, they must be either mobile or in a 
different location. As the carboxy terminus of MuA contains a ClpX 
binding tag, rearrangement of the R1 subunits’ domain IIIs upon target 
binding could allow ClpX to preferentially recognize the final strand 
transfer complex for ATP-dependent disassembly. Furthermore, it is 
these end-most subunits that ClpX preferentially unfolds*?**. 

Initial transpososome assembly requires domain IIx on the R2- 
bound, but not the Rl-bound, subunits****”*. The structure suggests 
that the R2 subunits’ domain IIus stabilize the complex by wrapping 
around the other subunits near the active site. The domain IHIas may 
also anchor the flanking host DNA (Figs 1 and 3). The construct 
crystallized included minimal flanking host DNA, but if extended, it 
could bind the R2 subunit’s IIx domain, occupying a spot where 
symmetry-related DNA interacts in the crystal. This model agrees with 
footprinting data that predicted a large distortion, and would prevent 
steric clashes between the flanking host and target DNAs'®”*. Domain 
IIIa was reported to have cryptic nuclease activity that might cleave the 
flanking DNA flap after the initial insertion reaction*”’. Alternatively, 
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Figure 3 | Stereo close-up view of 
interactions near the Mu DNA- 
target junction. Colours are the 
same as in Fig. 2. A segment of DNA 
from a symmetry-related complex 
(yellow) binds the positively charged 
domain IIe of the R2-bound subunit 
(cyan). If the red Mu end DNA were 
extended to include flanking host 
DNA, it could lie where the yellow 
DNA does. The yellow sphere marks 
the phosphate group at the Mu- 
target DNA junction, and the main 
chains of the two active site D 
residues are also yellow (a third active 
site residue lies on a helix that could 
not be modelled). The loop that 
extends from domain IIx (~amino 
acids 410-430) to interact with the 
black target DNA is circled on the red 
subunit. 


movement of domain IIIx (triggered by an unknown signal) might 
deliver the uncleaved strand to the DDE site for hydrolysis. 


Transpososomes with full left and right ends 

Although the complex that we crystallized is highly active in vitro, its 
assembly requires high protein and DNA concentrations or ‘permissive’ 
solvent conditions'*. The natural system is more complicated and 
provides an interesting example of templated complex assembly. The 
two ends of bacteriophage Mu genome carry different arrays of three 
MuA binding sites each, and the left end also binds the DNA-bending 
protein HU (Fig. 4). Conversion of an initial pairing of right and left 
ends to an active complex is stimulated by transient binding of the 
amino-terminal domains of several MuA subunits to an internal 
enhancer element*””*. However, if complexes assembled this way are 


Figure 4 | Model for a transpososome assembled on full left (reddish) and 
right (blue) bacteriophage ends. The N terminus of each domain If is marked 
with a red sphere to show the approximate position of domain Ix, which 

transiently binds the enhancer. Domains discussed in the text are labelled. Inset: 
cartoon of the bacteriophage Mu genome ends and internal enhancer element. 
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treated with a high salt wash, an active complex remains that contains 
only four subunits of MuA, contacting only the R1, R2 and LI DNA 
sites*””. Thus, the other binding sites are important for assembly but 
not for the final activity. 

Modelling showed that the functional part of the tetrameric assembly, 
where Mu ends join to target DNA, can be identical in the crystallized 
(R1R2), complex and in the full left plus right complex (Fig. 4). Subunits 
R1, R2, L1 and domains II and III of the L2 subunit were modelled 
directly from the crystal structure. The HU-induced bend allows a single 
protomer of MuA to bind the L2 site via domain I while its domains II 
and III form part of the core complex. Domain [7 acts as a linker, which 
explains why L2 is the only site where Iy doesn’t bind DNA”. 

Additional interactions involving the L3- and R3-bound subunits 
may temporarily hold the components together while the intertwining 
of protein and DNA needed to form a transpososome at the Mu-host 
junction occurs. We modelled interactions between the L3 subunit’s 
domain IIx and R2’s IB based on those seen in the crystal between the 
R2- and Rl-bound subunits. The R3 subunit’s role is unclear, but if the 
HU-induced bend were relaxed, similar cross-end interactions might 
occur between the L3 and R3 subunits. 

Topological studies predicted two right-handed superhelical cross- 
ings within the transpososome*”’. One such crossing occurs in the 
crystal structure, near the junction with target DNA (R1 over L1 in 
Fig. 4). A second crossing (L3 over R2) results from the severe bend 
induced by HU in the model, in conjunction with a smaller bend in 
the L2 site. Restraint of two supercoils within such short segments of 
DNA is consistent with observations that supercoiling stimulates 
transpososome assembly*'. 

During assembly on intact bacteriophage DNA, domain Ie transi- 
ently binds an internal enhancer, which contains two clusters of MuA 
binding sites termed O1 and O2. There are too many degrees of free- 
dom to add the enhancer to our model. However, our model does agree 
with data showing that Ol-bound proteins interact with both L3 and 
R1 and that proteins bound to the somewhat longer O2 may contact 
both R3 and L1**”. The multiple protein-protein interaction surfaces 
of MuA may help to stabilize an initial pairing of the bacteriophage 
ends, but interactions between the wrong partners could slow down the 
transition to a final, cleavage-ready complex. The enhancer may stimu- 
late this transition by preventing unproductive interactions among L- 
and R-bound subunits as well as by aligning them for productive ones. 

Such an elaborate assembly process is not limited to Mu: many other 
mobile elements also require seemingly ‘extra’ recombinase subunits”. 
Although the details vary among these systems, they may all be using 
the same fundamental strategy of using additional subunits to tem- 
porarily stabilize pairing of the element end DNAs while a complicated, 
intertwined structure forms at the 3’ ends. The additional complexity 
may also provide additional opportunities for regulation. Finally, for 
mobile elements that are present in high copy, it may help to ensure that 
the two ends paired in a single transpososome belong to the same copy 
of the mobile element. 


Convergent and divergent evolution 
Many DNA transposases and retroviral integrases share a structurally 
conserved DDE or RNaseH-like catalytic domain, indicative of diver- 
gence from a common ancestor. However, this is the only domain 
conserved among these diverse recombinases. Comparison of the four 
reported structures of DDE recombinases in complex with substrate 
DNAs shows that other recurring features may reflect convergent 
evolution for functional reasons (Fig. 5). All four complexes are held 
together by intertwined networks of protein-protein and protein- 
DNA contacts, although different domains mediate those contacts*. 
Mos! and MuA do have structurally related bipartite DNA-binding 
domains, but even those domains form different protein-protein 
contacts in their respective transpososomes’. 

Despite the diversity of these complexes, catalysis is always in trans: the 
subunit that catalyses DNA cleavage and joining on one mobile element 
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Figure 5 | Comparison of DDE recombinase-DNA complexes. The mobile 
element ends are red and blue, and target DNA (where included) is black. 
Subunits that carry out the chemical reactions are red and blue; additional 
subunits are pink and cyan. Active site residues, scissile phosphate groups, and 
the two B-strands of the conserved catalytic domain that carry the catalytic D 
residues are in yellow. Mos] is a Tc1/mariner family eukaryotic DNA 
transposon; Tn5 is a bacterial DNA transposon; and PFV is a mammalian 
retrovirus” ’. Mos] and Tn5 require only a dimer for activity, whereas Mu 
transposase and PFV integrase require tetramers. In the PFV structure, only the 
catalytic domains of the additional subunits were visible (pink and cyan). 


end binds to specific sequences on the other end. This feature ensures that 
the chemical reactions at the two element ends are coordinated, because 
the complex requires proper pairing of the ends for assembly. 

Another recurring feature is strong bending of the target DNA. 
Target DNA bound by the prototype foamy virus (PFV) intasome 
is also bent, although not quite as severely as that in the Mu transpo- 
sosome®. The Mos] transpososome was crystallized without target 
DNA, but additional end DNAs found in the crystal bind where target 
is expected to, and in a way that requires target bending’. Modelling of 
target DNA onto the Tn5 transpososome structure also requires 
bending, which agrees with biochemical data for the related Tn10 
system***. Outside of the catalytic domain, contacts to the target 
DNA vary widely among these structures. Why then have they all 
evolved to strongly bend the target DNA? As noted for the PFV 
structure, target bending may help render strand transfer irreversible 
by straining the DNA conformation such that the ends snap away 
from the active site after strand transfer. This may be a source of 
the product binding energy that drives forward the otherwise- 
isoenergetic strand transfer reaction. The overall conformation of 
the target DNA in the Mu transpososome resembles that bound by 
integration host factor (IHF). In that case, a nick at the kink does 
enhance affinity by allowing the ends to spring apart*®. 

The DDE catalytic domain is thus a conserved module that has 
been co-opted by numerous mobile elements to perform similar 
chemical reactions. However, other similarities in the way that it 
has been harnessed to mobilize these elements seem to reflect con- 
vergent evolution to satisfy functional requirements. 


METHODS SUMMARY 


We determined the structure of the final strand transfer complex, which contains a 
tetramer of MuA, two copies of the bacteriophage end DNA, and one target DNA 
(Supplementary Fig. 1 and Supplementary Video 1). Crystallizations used a slightly 
truncated protein, MuA(77-605), which is active in vitro and lacks only the 
N-terminal enhancer-binding domain and the C-terminal domain that interacts 
with ClpX and MuB, a second bacteriophage-encoded protein that helps deliver an 
appropriate target DNA under non-permissive conditions*’. The bacteriophage 
end DNA mimics pre-cleaved right ends, and the 35-bp target DNA contains a 
central GeG mismatch. Although MuA displays little sequence specificity for target 
DNA, it attacks mismatch-containing DNA with single-nucleotide precision’. This 
feature facilitated production of a homogeneous sample for crystallization. 
Phases were determined by MIRAS using three derivatives. The crystals dif- 
fracted anisotropically to 5.2 A, 5.2 A and 3.7A along the three principal axes. 
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Model building was possible despite the low resolution because >90% of the 
protein structure had been previously determined as isolated domains****”’. 
Placement of the protein domains was verified by SeMet data, and the DNA 
sequence register by an additional data set collected from crystals where every 
T on one strand had been substituted with 5-bromodeoxyuridine (BrdU) 
(Supplementary Fig. 4). No previous structure was available for domain IIIa, 
which comprises ~45 amino acids that are strongly predicted to form one long 
helix followed by a short one. Although density for these helices was visible, the 
sequence register is uncertain. The complex lies on a crystallographic two-fold 
axis such that the asymmetric unit contains half a transpososome. After highly 
restrained refinement, R and Ry... values were 39.3% and 43.7%, well within the 
range expected for a low-resolution structure. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Overview. We determined the structure of the final strand transfer complex, 
which contains a tetramer of MuA, two copies of the bacteriophage end DNA, 
and one target DNA (Fig. 2 and Supplementary Video 1). Crystallizations used a 
slightly truncated protein, MuA(77-605), which is active in vitro and lacks only the 
N-terminal enhancer-binding domain and the C-terminal domain that interacts 
with ClpX and MuB, a second bacteriophage-encoded protein that helps deliver an 
appropriate target DNA under non-permissive conditions”. The bacteriophage 
end DNA mimics pre-cleaved right ends, and the 35-bp target DNA contains a 
central GeG mismatch. Although MuA displays little sequence specificity for target 
DNA, it attacks mismatch-containing DNA with single-nucleotide precision’. 
This feature facilitated production of a homogenous sample for crystallization. 

Phases were determined by MIRAS using three derivatives. The crystals dif- 
fracted anisotropically to 5.2 A, 5.2 A and 3.7 A along the three principal axes. 
Model building was possible despite the low resolution because >90% of the 
protein structure had been previously determined as isolated domains****”. 
Placement of the protein domains was verified by SeMet data and the DNA 
sequence register by an additional data set collected from crystals where every T 
on one strand had been substituted with BrdU (Supplementary Fig. 4). No previous 
structure was available for domain IIo, which comprises ~45 amino acids that are 
strongly predicted to form one long helix followed bya short one. Although density 
for these helices was visible, the sequence register is uncertain. The complex lies on 
a crystallographic two-fold axis such that the asymmetric unit contains half a 
transpososome. After highly restrained refinement, R and Rgee values were 
39.3% and 43.7%, well within the range expected for a low-resolution structure. 
Expression and purification of the MuA transposase. The pMK599 plasmid, a 
pET3c derivative that contains the MuA open reading frame coding for residues 
77-605, was a gift from the Mizuuchi laboratory”’. This plasmid was transformed 
into Escherichia coli Rosetta pLysS strain (EMD Biosciences) for protein over- 
expression. After plating transformants, a starter culture was prepared by inocu- 
lating multiple colonies into LB media (with 100 pg ml” * ampicillin) and growing 
at 37°C until the absorbance at 600 nm (Agoo) was ~0.7. Typically, 100 ml of 
starter culture was prepared per litre of final culture. After the addition of starter 
culture to fresh ampicillin-containing LB media, cells were grown to Agoo of ~0.8, 
then protein expression was induced with IPTG (added to a final concentration of 
0.5 mM). Cells were collected 2 h after induction by centrifugation at ~8,000g for 
10 min, and cell pellets were stored at —80 °C for later use. 

Cell pellets were re-suspended in a lysis buffer (25mM HEPES (pH7.50), 
1mM EDTA, 1M NaCl, 10% sucrose, 10% glycerol, 5mM DTT, 200 pg ml! 
lysozyme, protease inhibitor cocktail from Roche Diagnostics), sonicated, and 
centrifuged at 40,000g for 1 h (18,000 r.p.m. in a SS-34 rotor). Ammonium sul- 
phate was added to the supernatant to 30% saturation to precipitate the protein. 
The pellet was collected by centrifugation, and redissolved in buffer A (20 mM 
MES (pH5.5), 0.5mM EDTA, 5% glycerol, 0.2 M NaCl and 1mM DTT). The 
protein sample was filtered before loading onto a heparin affinity column (GE 
Healthcare). Proteins were eluted with salt gradient from 0.2 M to 2.0 M NaCl. To 
improve the purity, MuA-containing fractions were rechromatographed on 
heparin after dialysis into buffer A. The protein was then dialysed into buffer A 
again and loaded onto a Mono-S column (GE Healthcare). A gradient similar to 
that from the heparin affinity purification was applied. Fractions containing MuA 
were pooled and dialysed at 4°C into 20mM HEPES (pH7.5), 0.5mM EDTA, 
0.2M ammonium sulphate, 20% glycerol, and 1mM DTT. The protein was 
concentrated to approximately 10mg ml’, and stored at —80°C. Minimal 
nuclease contamination was detected when samples (0.5 mg ml ' final concen- 
tration) were incubated for 2 h at 37 °C in 10 mM HEPES pH 7.5 with supercoiled 
plasmid DNA, 50 mM NaCl and 10 mM MgCh. 

SeMet-labelled MuA(77-605) was prepared similarly except that cells were 
grown differently*': Instead of using LB, the cells were inoculated in M9 media 
plus 0.4% glucose, 10 mM NaCl, 0.1 mM CaCh, 2mM MgSO, and 100 mg ml 
ampicillin until Agog reached ~0.5. An amino acid cocktail containing 
L-isoleucine, L-leucine, L-lysine, L-phenylalanine, L-threonine and L-valine was 
then added to a final concentration of 100 mg of each amino acid per litre. Seleno- 
pi-methione (Sigma) was also added to a final concentration of 60 mg! *. The 
culture was grown for 15 more minutes before 0.5 mM IPTG was added. Cells 
were collected after 3h of induction. 

Preparation of Mu end and target DNA. Mu end DNA duplexes were designed 
to contain the R1 and R2 binding sites for MuA. Each duplex was prepared by 
mixing four single strands in equal molar amounts. The oligonucleotides used for 
the structure determination are listed below: TL, 5’-GCTTGAAGCGGCGCA 
CGAAAAACGCG-3’; TR, 5'-AAAGCGTTTCACGATAAATGCGAAAAC-3'; 
BL, 5'’-AACGCTTTCGCGTTTTTCGTGCGCCGCTTCA-3’; BR, 5’-CGGTT 
TICGCATTTATCGTGA-3’. These strands were heated at 80°C for 20 min, 
and annealed by slow cooling to room temperature. The final concentration of 


the duplex DNA is 0.2 mM in TEN buffer (10 mM Tris-HCl and 0.5 mM EDTA, 
100 mM NaCl, pH 8.0). The resulting DNA mimics the product of initial DNA 
cleavage by MuA, and has a three-nucleotide 5’-overhang on the uncleaved 
strand, and a two-nucleotide 5'-overhang on the other end. Each strand of the 
resulting duplex is nicked at a position that does not interfere with transpososome 
assembly. 

The target DNA contains a central mismatch and was designed to be asym- 
metric to avoid hairpin formation during annealing. The target DNA was pre- 
pared in a manner similar to that of the Mu end DNA using the following 
oligonucleotides: (1) 5’-TATCGCAACAACACATCGGATAACCATAAGTAA 
TA-3’; (2) 5’-TATTACTTATGGTTATCGGATGTGTTGTTGCGATA-3’. 

All unmodified oligonucleotides were obtained from IDT Technologies. 
Brominated oligonucleotides (discussed below) that were used in this study to 
validate the sequence of the donor DNA and the location of the target DNA were 
obtained from Yale University’s Keck Facility. Because brominated oligonucleo- 
tides are photolabile, they were handled in the dark as much as possible. 
Crystallization and data collection. Strand-transfer complexes were assembled 
by mixing the target DNA, Mu end DNA, and MuA protein in 1:1.4:3.7 molar 
ratios in a solution containing 10 mM MgCl, 25 mM HEPES (pH 7.5), 10 mM 
DTT, 0.02% Zwittergent, 14% glycerol, and 0.2 M (NH4)2SO4. This was incubated 
for at least 1 h at room temperature to ensure completeness of the strand transfer 
reaction. Although DMSO is usually added to stimulate assembly of transposo- 
somes with two right end DNAs, we found that it was not necessary at the high 
protein and DNA concentrations used for crystallographic work"*. Crystallization 
trials were then performed using the hanging-drop vapour diffusion method: 
drops contained a 1:1 mixture of complex stock solution (~2 mg ml ') and well, 
and were incubated at 19 °C. Crystals appeared in 22-28% (v/v) PEG400, 0.1M 
HEPES (pH 7.50), and 0.2M MgCl, and grew to their full size in 2-4 weeks. 
Tantalum bromide derivatives were obtained by soaking the crystals of the 
strand-transfer complex for 1-8 days with 0.4mM [TagBr,2]** cluster (Jena 
Bioscience) in a solution that mimicked the condition of the drop. For derivatiza- 
tion with mercury, crystals were soaked in 32% PEG400, 0.1 M HEPES (pH 7.5), 
0.2 M MgCl and 0.1 mM mersalyl acid (Sigma) for 1-2 days. For derivatization 
with selenium, crystallization setups were done with SeMet protein. And for that 
with bromine, the brominated donor DNA where every thymine on the T-rich 
strand (oligos BL & BR) was replaced with BrdU was used. All crystals were frozen 
in liquid N2 directly from the drop. 

Numerous data sets were collected from several different beamlines at the 

Advanced Photon Source in Argonne. Many crystals were screened at BIO- 
CARS 14-BM. All data sets used for the final phasing and refinement were 
collected at SBC-CAT 19-ID beamline at 100K temperature. For the SeMet data, 
data sets collected from two different crystals were merged to improve complete- 
ness of data, especially at the low-resolution shells. X-ray data collected from the 
native and Ta-derivative crystals were integrated and scaled with HKL3000 suite 
and the others with HKL2000°. A summary of the data collection statistics is 
shown in Supplementary Table 1. 
Structure determination and refinement. The toehold in solving the structure of 
the Mu transpososome was a single Tantalum bromide cluster (Supplementary 
Table 1). This cluster was initially found using direct methods in SHELXD™ from 
a 5-day-soaked Ta-data set, and was consistent with the anomalous difference 
Patterson maps™ generated from other Ta-data sets where crystals were soaked 
for 1, 3, 5, 7 and 8 days. SIRAS phases from this one cluster were generated using 
MLPHARE and were used in anomalous difference Fourier methods to deter- 
mine the substructure for the mercury derivative’. We used SIRAS phases cal- 
culated from the Hg derivative to confirm independently the Ta site. After several 
rounds of difference Fourier calculation, we were able to locate the rest of the 
heavy atoms. Final MIRAS phases were generated from 4 [Ta.Brj.]?*, 3 Hg and 
17 Se sites. Reasonable figures of merit were obtained before density modification: 
0.41 for centric and 0.25 for acentric reflections. With 77% solvent in the crystal, 
further phase improvement was achieved by density modification using Parrot”. 
Electron density maps generated showed clear density for the DNA as well as 
tubular densities that represent protein helices. The protein structure was initially 
modelled by docking previously determined structural domains (IB, Iy and cata- 
lytic domains) of the MuA transposase into the density using Se peaks as markers. 
The bromine sites, despite not being included in calculating phases due to the low 
resolution of that data set, were particularly useful in guiding the model building 
for the donor DNA. We also have a low-resolution data set from a crystal that 
contains a symmetric brominated target DNA: 5’-TATCGCAACAACACA 
TCGGATGTGTTGTTGCGATA-3’. Bromine peaks obtained from this par- 
ticular crystal was useful in confirming the location of our target DNA 
(Supplementary Fig. 4). 

The transpososome lies on a crystallographic two-fold axis such that the asymmetric 
unit comprises half of a transpososome: two MuA protomers, one Mu end DNA, and 
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half target DNA. The initial model revealed possible loose contacts between two 
crystallographically related copies of domain IIB (a B-barrel). To improve the diffrac- 
tion of our crystals, we engineered three sets of mutations (Quikchange, Stratagene) in 
that region of contact: a single M521W mutation and two double mutations, M521W/ 
N525L and M521L/N525L. The latter led to an improvement of the resolution along 
the best diffracting axis of our ‘native data set’ from 4.2 to 3.7 A. 

The structure was modelled in COOT”, and refined using PHENIX™. Density 
was visible for several sections that unfortunately could not be modelled due to 
the resolution: for example, the linker between domains If and Iy lies in the 
minor groove as seen for Mos] and CENP-B, and the region around the third 
active site residue of the Rl-bound subunits, which clearly changes conformation 
from the inactive form seen in isolated domain II structures. Multiple restraints 
were used during refinement due to the low resolution of the data. These include 
H-bond restraints on the DNA base pairs, secondary structure restraints, model 
restraints where models of the individual domains of the MuA transposase were 
obtained from the PDB, NCS restraints, and Ramachandran restraints. Nine TLS 
groups were used: (1) R1B and the DNA with which it is interacting; (2) R2B and 
DNA; (3) Rly and DNA; (4) R2y and DNA plus R2-domain IIo; (5) R2-domain 
II including the B-barrel; (6) R1-domain II without the B-barrel; (7) R1-B-barrel; 
(8) R1-domain IIIa; and (9) target DNA including the sequences after the CA step 
in the donor DNA (that do not interact directly with the DNA binding domains). 
As categorized by PROCHECK, the percentage of residues in the following regions 
of the Ramachandran plot were: favoured/allowed/generous/disallowed = 93.8/ 
5.4/0.2/0.5. Several variations on this protocol were tried. Simply removing the 
Ramachandran restraints made very little difference, probably because most of 
the model was already restrained to previously determined domain structures. The 
unrestrained Ramachandran plot statistics were: favoured/allowed/generous/dis- 
allowed = 90.1/9.0/0.4/0.5, and Rwork/Rfree Were 40.1/43.6% (as opposed to 39.3/ 
43.7% with restraints). Superimposing the two structures (refined + Rama 
restraints) revealed some slight differences in residues 347-356. This is the region 
where Ramachandran outliers were observed. However, upon inspection of the 
experimental map and the difference maps, it was difficult to discern which was 
more correct. Hence, we are choosing to report a structure that has better geo- 
metry. We also tried refinement with DEN”, but it only improved the Ree. by 0.3% 
and greatly degraded the Ramachandran plot. Our structure may be an unusual 
test case for DEN because of the low resolution of our data and the high quality of 
our individual domain models. 

During the initial rounds of refinement, progress stalled when R¢-ee Was ~49%. 
However, after ellipsoidal truncation and anisotropic scaling were performed on 
the native data set using the Diffraction Anisotropy server®, the Rg,ee considerably 
improved to ~44%. The server truncated the data set to 3.7 A along c* and 5.2 A 
along a* and b*. The final refined structure has an Rwork and Rfree of 39.30% and 
43.70%, respectively. 

Modelling the full transpososome. To model the full complex, additional model 
B-form DNA coordinates was created using the W3DNA server*’. DNA and 
protein coordinates were manipulated in both pymol and coot. Subunits R1, 
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R2, L1 and domains II and III of the L2 subunit in the model could be taken 
directly from the crystal structure. To model the other subunits, we docked 
domain If of the R2 subunit and the DNA segment it binds onto the appropriate 
site in the modelled DNA. The L1 and L2 binding sites are separated by an 
~80-bp segment where the DNA bending protein HU binds. Modelling of the 
HU-induced bend was based on the structure of a closely related IHF-DNA 
complex and on footprinting data for HU synergistically bound within this 
loop®*®*. Modelled DNA for the L end was broken and appropriate sections 
abutted to the ends of the DNA in the IHF-DNA structure. We justified some 
additional bending of the DNA on the L2 end of the HU site based on the 
symmetry-related DNA in the IHF structure, and the fact that IHF- and HU- 
induced bends are known to be flexible. Bending of the model DNA in the L2 
binding site was based on bending seen crystallographically in the R2 site. In 
modelling the R3-bound subunit, the other domains simply followed I as a rigid 
body, which gives only a rough placement of domain II. For the L3-bound sub- 
unit, we modelled an interaction between its domain II and the R2 subunit’s 
domain If based on the II-If interactions seen in the crystal. 

Figures were prepared using Pymol (The PyMOL Molecular Graphics System, 
Version 1.3). 
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A primordial origin for misalignments between 
stellar spin axes and planetary orbits 


Konstantin Batygin'* 


The existence of gaseous giant planets whose orbits lie close to their 
host stars (‘hot Jupiters’) can largely be accounted for by planetary 
migration associated with viscous evolution of proto-planetary 
nebulae’. Recently, observations of the Rossiter-McLaughlin effect” 
during planetary transits have revealed that a considerable fraction 
of hot Jupiters are on orbits that are misaligned with respect to the 
spin axes of their host stars’. This observation has cast doubt on the 
importance of disk-driven migration as a mechanism for producing 
hot Jupiters. Here I show that misaligned orbits can be a natural 
consequence of disk migration in binary systems whose orbital 
plane is uncorrelated with the spin axes of the individual stars*®. 
The gravitational torques arising from the dynamical evolution of 
idealized proto-planetary disks under perturbations from massive 
distant bodies act to misalign the orbital planes of the disks relative 
to the spin poles of their host stars. As a result, I suggest that in the 
absence of strong coupling between the angular momentum of the 
disk and that of the host star, or of sufficient dissipation that acts to 
realign the stellar spin axis and the planetary orbits, the fraction of 
planetary systems (including systems of ‘hot Neptunes’ and ‘super- 
Earths’) whose angular momentum vectors are misaligned with 
respect to their host stars will be commensurate with the rate of 
primordial stellar multiplicity. 

The obliquities (angles between the planetary orbits and the stellar 
spins) of detected planetary orbits range from almost perfectly aligned 
prograde to almost perfectly aligned retrograde systems’. Previously, 
the misalignment between planetary orbits and stellar spin axes had 
been attributed to post-nebular multi-body interactions. Most notably, 
Kozai cycles with tidal friction*°, planet-planet scattering'’’”, and 
chaotic secular excursions’* have been invoked as a means of producing 
misaligned planets. These mechanisms are probably responsible for 
a few specific examples (for example, the extreme eccentricity of 
HD80606b is almost certainly due to Kozai resonance with the stellar 
companion HD80607*). However, it is unlikely that they can explain 
misaligned hot Jupiters as a population. For instance, the Kozai mecha- 
nism can be stifled by forced apsidal precession in multi-planet sys- 
tems'’. Likewise, within the context of planet-planet scattering and 
secular chaos, the allowed parameter range is limited, because the 
production of close-in orbits requires the timescale for tidal capture 
to be considerably shorter than that for eccentricity growth’’, while 
demanding the associated tidal heating to be small enough not to over- 
inflate the planet beyond its Roche lobe’. Additionally, the observed 
presence of mean-motion resonances among giant planets on wide 
orbits (which rely on smooth, convergent migration to congregate’*) 
provides further motivation for the development ofa unified model for 
disk migration that is capable of producing misaligned orbits. 

The dynamics of self-gravitating proto-planetary disks under external 
perturbations can be extremely complex, making precise quantitative 
modelling computationally unfeasible. Consequently, here I shall con- 
centrate on characterization of the qualitative physical behaviour of 
the system and use classical perturbation methods to obtain a solution. 
In the spirit of secular theory'®, I model the proto-planetary disk as a 


series of initially planar, circular, concentric, massive wires that inter- 
act gravitationally. Our model is based on the Gaussian averaging 
method'”’* and the gravitational potential is softened to partially 
account for the discrete representation of the disk. The effects of dissi- 
pative fluid forces within the disk are neglected. The perturbing body is 
also modelled as a massive ring, but is eccentric (e’ = 0.5) and inclined 
with respect to the disk by an inclination i’. A description of the model 
and its inherent assumptions is presented in ref. 19, and the details of 
our implementation are stated in the Supplementary Information. 

A self-gravitating disk will preserve an untwisted structure and act 
as a rigid body, provided that the characteristic timescale of the exter- 
nal perturbation greatly exceeds that of the disk’s self-interaction”. 
Mathematically, this amounts to a statement of adiabatic invariance 
of the phase-space area occupied by a single secular cycle within the 
disk’'. If this condition is satisfied, the external perturber’s sole effect is 
to induce a recession (that is, a retrograde drift) of the ascending node 
of the disk, as defined by the plane of the stellar orbit. The embedded 
planetary orbit will also adiabatically follow the disk. 

In the reference frame of the host star, the nodal recession of the disk 
will appear as a cyclic excitation of inclination between the disk and the 
stellar spin axis (see Fig. 1), provided that the host star’s angular 
momentum vector does not adiabatically trail the disk. For this to hold 
true, the characteristic interaction timescale between the disk and the 
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Figure 1 | Geometrical set-up of the problem. This figure depicts a schematic 
representation of the production of misaligned close-in planets through disk- 
driven migration in binary systems. The adiabatic response of a self-gravitating 
disk to long-term perturbations by a stellar companion is the recession of its 
ascending node, as defined by the orbital plane of the stellar companion. The 
recession of the disk’s angular momentum vector about the stellar binary 
orbital angular momentum vector appears to be an excitation of the 
misalignment angle y between the stellar spin axis and the disk in the star’s 
reference frame. 


1Division of Geological and Planetary Sciences, California Institute of Technology, Pasadena, California 91125, USA. Institute for Theory and Computation, Harvard-Smithsonian Center for Astrophysics, 60 


Garden Street, Cambridge, Massachusetts 02138, USA. 


418 | NATURE | VOL 491 |15 NOVEMBER 2012 


©2012 Macmillan Publishers Limited. All rights reserved 


stellar spin-axis, T sar, must exceed the disk’s nodal recession time- 
scale, T disk, by a considerable amount (that is, angular momentum 
coupling between the disk and the host star must be non-adiabatic)’. 
The former can be estimated by modelling the stellar rotational bulge 
as an inertially equivalent orbiting ring, effectively reducing the cha- 
racteristic interaction timescale to the forced nodal recession period of 
the ring. 

Observations suggest that rotational periods of T Tauri pre-main 
sequence stars, whose masses exceed M>0.25Mo, where Mo is the 
mass of the Sun, form a bimodal distribution where fast and slow rota- 
tors are centred around 2 days and 8 days respectively, with a preference 
for slow rotation at higher masses”. Thus, for typical pre-main-sequence 
stars, we obtain T star~10 Myr and T gtar~0.3 Myr for slow and fast 
rotators, respectively (see Supplementary Information for details). As 
will be shown below, this suggests that the adiabatic trailing of the stellar 
spin axes will only prevent excitation of mutual misalignment for fast 
rotators. Furthermore, the transfer of angular momentum from the disk 
to the star will probably be unimportant in mature disks because of low 
accretion rates”. 

In addition to avoiding the adiabatic trailing of the host stars, the 
prominence of the mechanism described here is determined by the 
abundance and longevity of wide stellar binary systems in star-formation 
environments, because the misalignment angle, i, becomes fixed when 
the binary companion is stripped away or when the proto-planetary disk 
dissipates (although a long-lived binary companion can act to misalign a 
mature planetary system with its host star~*). It is tempting to estimate 
the frequency with which significant misalignments are attained by this 
mechanism via population synthesis, but the enormous range and vast 
observational uncertainties in the input parameters would render such a 
calculation of little practical use. In particular, although observations of 
the Taurus—Auriga star-forming region” suggest that the orbital distri- 
bution of young solar-type binaries is roughly log-flat with an overall 
binary fraction of around 40%, it is noteworthy that the process of wide 
binary formation also appears to exhibit environmental dependence’. 
Simultaneously, the rate at which wide binaries get disrupted in birth 
clusters depends sensitively on the local densities within the clusters’, 
which remain observationally elusive, because the majority of stars are 
born in aggregates that dissolve quickly (on timescales of a few tens of 
millions of years or less)”°. Nevertheless, the above conditions probably 
imply that the timescale for excitation of significant misalignment 
should be considerably less than 10 Myr or so. 

In systems where self-gravity is strong enough to maintain the 
effective rigidity of the disk, fast circulation of the disk’s argument of 
perihelion also ensures adiabatic eccentricity dynamics. Specifically, 
this means that the disk will not develop significant eccentricity, and 
the Kozai resonance within the individual annuli will be suppressed”® 
(see Supplementary Information). Additionally, recall that here I am 
ignoring dissipative effects that would generally act to circularize the 
disk and maintain its rigidity. Assuming that the angular momentum 
of the stellar binary greatly exceeds that of the proto-planetary disk, the 
maximal inclination that can be excited between the stellar rotation 
axis and the disk’s orbital plane is approximately equal to twice the 
inclination of the proto-planetary disk with respect to the binary orbit: 
Winax~2i. As a result, retrograde planetary orbits can be naturally 
achieved, provided that the inclination of the stellar companion 
exceeds 45°. Figure 2 shows the time evolution of two such examples. 

Although the Gaussian averaging model used above yields a rigorous 
representation of the secular evolution of the system, it is also compu- 
tationally expensive. Fortunately, similar results can be obtained with 
a modified (arbitrary i’, e’) Laplace-Lagrange analytical theory’® 
(see Supplementary Information), providing an avenue for efficient 
mapping of parameter space. We have quantified the precession time- 
scale for a range of stellar companion masses as well as binary separa- 
tions. The results are summarized in Fig. 3. This calculation highlights 
an effective equivalence between distant massive perturbers and lower- 
mass perturbers with smaller semi-major axes, because the recession 
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Figure 2 | Excitation of disk-star misalignment. The time evolution of the 
misalignment angle i is shown. The nebula I consider, with disk mass 

Mdisk = 10° 7Mo ~10Mhjup, has a surface density profile of the form 2 « rl, 
extends between a;, = 1 astronomical units (AU) and doy = 50 AU, and orbits a 
host star with mass M, = 1 Mo. The plotted curves represent the dynamical 
states of the disk annuli, where every other ring is plotted. The grey curves 
depict the innermost annuli. The close proximity of the rings to each other 
demonstrates that the disk remains locally unwarped and acts as a rigid body to 
a good approximation. The extent of rigidity is largely controlled by the disk 
mass, with heavier disks remaining closer to their mid-plane. The host star is 
assumed to be a slow rotator and its spin pole is held fixed for simplicity. The 
perturbing M’ = 1 M, binary companion lies a’ = 500 Au away, has an 
eccentricity of e’ = 0.5, and is inclined with respect to the disk by i’ = 45° (red) 
and by i’ = 70° (blue). Throughout the duration of the integration, the annuli of 
the disk never attain significant eccentricity (e<0.1; see Supplementary 
Information for an in-depth discussion). This calculation was performed using 
a conservative softened Gaussian averaging model (with Ning, = 31; see 
Supplementary Information), and thus contains no restrictions on the secular 
dynamics of the system, but ignores dissipative forces of the gas. The nodal 
recession periods characteristic of this set-up are T isk ~0.9 Myr (i’ = 45°) and 
T disk“ 1.8 Myr (i’ = 70°). The results in this figure can be translated to 

other system parameters by noting that the maximum misalignment attainable 
by the disk is roughly twice the disk/binary orbit inclination and that the 
recession period scales approximately as T disk ca® / (M "cosi’ (I +3e7 / 2)) 
(see also Fig. 3). 


period of the disk scales approximately as T disk 0 a’ i M’, where a’ is the 
binary star’s semi-major axis and M’ is its mass. Thus, the precessional 
effect of a M’=1Mo star orbiting at a’ = 10° au is equivalent to the 
precessional effect that arises from the protoplanetary disk and its host 
star orbiting an approximately 10°Mo star cluster at a’ = 0.25 pc. I 
note that bound companions are not necessarily required for produc- 
tion of oblate disks, given that impulsive perturbations from passing 
stars in the birth cluster will cause the inclination of the disk to execute 
arandom walk and in some cases can excite significant misalignment”. 
Collectively, this explanation places the approximately 7° misalign- 
ment between the Sun’s spin axis and the Solar System’s invariable 
plane into a more general, extrasolar context. However, the process 
of planet formation in perturbed, possibly warped, disks certainly 
deserves further study. 

Given the diverse nature of the environments in which planetary 
systems may form, one would expect a wide range of characteristic 
precession timescales for proto-planetary disks. Although this does not 
necessarily imply an isotropic distribution of spin-orbit angles, it is 
quite possible that hot Jupiters that emerged from protoplanetary disks 
in multi-stellar systems already resided on misaligned orbits at the 
time of nebular dispersion. If this is true, small spin-orbit angles must 
be in large part either a result of adiabatic trailing of the host stars or 
dissipative re-alignment of the system. Such a scenario appears to be 
supported by observations: hot Jupiters orbiting hot, massive stars tend 
to be misaligned while hot Jupiters orbiting less-massive cooler stars 
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Figure 3 | Timescales for excitation of spin-orbit misalignment. The 
characteristic nodal recession period of a disk with mgisx = 107M. is shown as 
a function of binary mass and orbital properties. The disk considered has an 
outer edge at dour = 50 AU. Increasing do, will result in an approximately linear 
increase of the precession frequency. The period is expressed as a scaling law in 
the mass of the host star. In the region of parameter space where the precession 
period greatly exceeds the disk lifetime, only small misalignment angles 
between the disk and the host star can be excited. In principle, however, if the 
stellar companion does not get stripped away, the ascending node of the 
invariable plane of the formed planetary system can also recess. However, the 
degree to which this can affect a planet on a close-in orbit is sensitive to the 
particular architecture of the system. In the region of the parameter space where 
dynamics ceases to be adiabatic, misalignment is certainly attainable, but more 
quantitatively precise (magnetohydrodynamical) modelling is required for its 
characterization. Finally, at the extreme high-mass/small-orbital-separation 
end of parameter space, one could envision a scenario where a newly formed 
disk becomes severely twisted and eventually gets disrupted as a result of strong 
external perturbations. Driven by viscous dissipation, however, such a structure 
would probably re-collapse into a new protoplanetary disk, whose orbital plane 
will be close to the Laplace plane of the stellar binary orbit. 


tend to have small spin-orbit angles*. This transition has been attri- 
buted to tidal re-alignment of cooler stars owing to the increased size of 
their convective zones” and thus appears to be in good agreement with 
our model. 

The consistency arguments presented above indicate that disk- 
driven migration in binary systems is a favourable origin of misaligned 
hot Jupiters. This model can be tested as follows. Although multiple 
explanations exist for the origins of hot Jupiters, short-period multi- 
planet systems, which are typically less massive, have almost certainly 
undergone disk-driven migration. Theoretically, disk-driven migration 
tends to maintain coplanarity and near-resonances in the planetary 
systems”, both of which have been spectacularly confirmed by the 
Kepler mission*®. Within the context of the model proposed here, 
proto-planetary disks become misaligned with the spin axes of their 
host stars irrespectively of the masses and orbital radii of the newly 
formed planets. I predict that future observations of the Rossiter- 
McLaughlin effect will reveal that systems of close-in coplanar sub- 
giant planets can be misaligned with respect to the spin axes of their 
host stars in the absence of significant dissipative processes. Similarly, 
the presence of distant, nearly circular resonant planet pairs in systems 
that host hot Jupiters on oblique orbits would also point to disk-torque 
as the likely mechanism for the origin of spin-orbit misalignment. 
High-precision radial velocity monitoring of transiting systems should 
directly test this prediction in the near future. 
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Long-distance quantum teleportation and quantum repeater tech- 
nologies require entanglement between a single matter quantum bit 
(qubit) and a telecommunications (telecom)-wavelength photonic 
qubit'~. Electron spins in III-V semiconductor quantum dots are 
among the matter qubits that allow for the fastest spin manipula- 
tion®’ and photon emission*”, but entanglement between a single 
quantum-dot spin qubit and a flying (propagating) photonic qubit 
has yet to be demonstrated. Moreover, many quantum dots emit 
single photons at visible to near-infrared wavelengths, where silica 
fibre losses are so high that long-distance quantum communication 
protocols become difficult to implement'®. Here we demonstrate 
entanglement between an InAs quantum-dot electron spin qubit 
and a photonic qubit, by frequency downconversion of a spon- 
taneously emitted photon from a singly charged quantum dot to 
a wavelength of 1,560 nanometres. The use of sub-10-picosecond 
pulses at a wavelength of 2.2 micrometres in the frequency down- 
conversion process provides the necessary quantum erasure to 
eliminate which-path information in the photon energy. Together 
with previously demonstrated indistinguishable single-photon 
emission at high repetition rates'’”’, the present technique advances 
the III-V semiconductor quantum-dot spin system as a promising 
platform for long-distance quantum communication. 

A quantum communication network’ will consist of stationary 
matter qubits and flying photonic qubits. The two key technologies 
for quantum communication networks are quantum teleportation 
from a photonic qubit to a matter qubit or vice versa, and a quantum 
repeater to create and store the entangled states of remote matter 
qubits. These two core technologies rely on the ability to create entan- 
glement between a matter qubit and a photonic qubit over long dis- 
tances, followed by a Bell state measurement that transforms the matter 
qubit-photonic qubit entanglement into the teleportation of a given 
unknown quantum state, or into matter-matter entanglement’? "*. 
Matter—photon and matter-matter entanglement generation have been 
shown in several ionic*’*'” and atomic**'® systems, while matter- 
photon entanglement was recently observed in nitrogen-vacancy dia- 
mond colour centres*. However, these systems suffer from low photon 
collection efficiencies and relatively long optical recombination times, 
which can only partially be overcome by cavity-quantum electro- 
dynamic (cavity-QED) solutions’. In addition, none of them use tele- 
com-wavelength (1.5 1m) photons that would allow long-distance 
entanglement distribution. Combining a solid-state monolithic cavity- 
QED system with a fast quantum emitter’?! could solve the low 
photon yields. In the InAs quantum-dot system, subnanosecond 
optical recombination times have previously led to the demonstration 
of high single-photon rates when the quantum dots are embedded in 
high-quality optical cavities*”*°*’. Moreover, spin control of charged 
InAs quantum dots was shown to be feasible on picosecond timescales 


while preserving spin coherence*’”’, making it a promising candidate 


as a quantum network technology. However, spin-photon entangle- 
ment has yet to be established in this system, and its emission wave- 
length does not match the low-loss wavelength range of silica fibres. In 
this work, we address both these challenges with an ultrafast frequency 
downconversion technique based on a periodically poled lithium nio- 
bate (PPLN) waveguide device”. Its timing resolution enables demon- 
stration of entanglement between a single InAs quantum-dot spin 
qubit and a photonic qubit at 910nm, while the target wavelength 
(1,560 nm) would permit long-distance quantum communication. 

The physical system used in our experiment is presented in Fig. la—d. 
It consists of a single electron-doped InAs quantum dot, embedded ina 
low-Q microcavity (see also Methods). With an external magnetic field 
oriented in the Voigt geometry (perpendicular to the growth direction/ 
optical axis), two A-systems are formed, where an optically active, 
excited state is coupled to each of two ground states. The excited states 
are so-called trion states, as they consist of three particles: two electrons, 
paired into a singlet state, and an unpaired hole (|1 | {}) and |{ | {})). Both 
trion states are connected to each of the electron spin states, | |) and |{), 
ina A-configuration. These A-systems have been extensively studied”, 
and were previously used for initialization*® and coherent manipula- 
tion of the electron spin states®*’’’°. The optical selection rules are 
indicated in Fig. 1a, and were verified by polarization-selective photo- 
luminescence (Fig. 1b). In addition, initialization, rotation and measure- 
ment of the spin state of the quantum dot are realized, using all-optical 
techniques as reported previously®’ (see Fig. 1c, d and Methods). By 
selectively exciting one of the trion states, for example, the lt Lf})-state, 
spontaneous emission decay in a A-system leads to entanglement 
between the emitted photon and the electron spin**: 


_ il 


v2 


Here do refers to the difference in energy (colour) of the emitted 
photons, determined by the electron Zeeman energy, and H and V to 
their polarization (see Fig. 1c); w refers to the frequency (wavelength) 
of the V-polarized photon (910.10nm). Previous coherent spin 
manipulation experiments®” relied on a relatively large Zeeman split- 
ting dw between the respective electron spin states, both for high- 
fidelity spin initialization and readout, and for ultrafast coherent 
control of the spin through a combination of stimulated-Raman tran- 
sitions and Larmor precession. In the present experiment, this energy 
separation is set at about 2m X 17.6 GHz (external magnetic field 
B= 3T, Fig. 1d), leaving room for which-path information to leak 
out into the environment via the photon energy, which makes veri- 
fication of entanglement between the polarization of the photon and 
the electron spin challenging. Upon detection of an emitted photon in 
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Figure 1 | Level structure of quantum dot and 
spin manipulation. a, Level structure of an 
electron-doped quantum dot, with the magnetic 
field in-plane (Voigt geometry, boxed). V and H 
refer to linear polarizations, either perpendicular to 
(V) or parallel (H) to the magnetic field. Blue and 
orange arrows refer to the electron and (trion-) 
hole states, respectively. b, Verification (by 
polarization-selective magnetophotoluminescence) 
of the polarization selection rules of the studied 
electron-doped quantum dot. c, Level structure and 
optical spin manipulation scheme used. 6, Larmor 
precession frequency (2m X 17.6 GHz for B = 3 T); 
@, frequency of a V-polarized photon (wavelength 
910.10 nm for B = 3 T); c.w., narrowband, 
continuous wave laser used for initialization and 
readout; (.., effective spin Rabi frequency resulting 
from manipulation by detuned (4 = 300 GHz), 
circularly polarized optical pulses®’. d, Ramsey 
interference experiment, demonstrating coherent 
control of the electron spin qubit. Blue, raw data, for 
n/2-1/2 interference; green, least squares fit. For a 
3 T magnetic field, the Larmor precession frequency 
dw = 2m X 17.6 GHz. Inset, pulse scheme used. 
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the rotated polarization basis (say, | *)-photon) at time fy, the result- 
ing spin state evolves as follows: 


Manes (0) =e (o—I1) —1N)) (2) 


Owing to the large Zeeman splitting, a small uncertainty Af, in the 
photon’s arrival time can result in a large uncertainty in the phase of 
the spin state (6 X At,) and therefore a reduction in the entangle- 
ment visibility. Lowering the magnetic field further reduces the fidelity 
of the single-photon-based spin readout® and is therefore not a viable 
option in our experiment. Previous experiments in ionic* and NV- 
diamond systems’ relied on the speed (timing jitter) of commercial 
single-photon detectors to fix the phase of the coherent spin preces- 
sion, or, equivalently, to act as a quantum eraser”’ of the relatively small 
energy difference between different branches of the A-system (Sup- 
plementary Information). Such high bandwidth/fast detection tech- 
niques have also been employed to increase visibility in photon—photon 
interference experiments’. However, the much larger energy difference 
in our system requires a more advanced quantum eraser technique. 
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In order to tackle the large energy difference, we use a frequency 
downconversion technique that converts a single 910-nm photon into 
a single 1,560-nm photon with sub-8-ps timing resolution; this 
extreme timing resolution is sufficient to erase the which-path fre- 
quency information. The technique consists of mixing the single 
910-nm photon with a few-picosecond pump light pulse at 2.2 um 
in a PPLN waveguide device, producing a 1,560-nm photon via the 
process of difference-frequency generation (Methods). After narrow- 
band filtering at 1,560 nm, the frequency-downconverted photon can 
be detected on a superconducting nanowire single-photon detector” 
(SNSPD) conditional on the exact overlap of the single 910-nm photon 
and the 2.2-j1m pump pulse. The detection of the 1,560-nm photon 
therefore heralds a time-accurate measurement of the 910-nm 
photon: only if the 910-nm photon entered the PPLN waveguide 
exactly at the same time as the 2.2-11m pulse can a 1,560-nm photon 
be generated. 

The system diagram of the conversion set-up is indicated in Fig. 2a. 
Using cross-correlation with a bright, 3-ps 910-nm pulse (Fig. 2b), we 
can infer a timing resolution of 8 ps or better for the PPLN waveguide 
device (Methods). As 8 ps correspond to an effective bandwidth of more 


Figure 2 | Ultrafast conversion to 1,560 nm. 

a, Schematic overview of the conversion technique 
used (Methods). The 910-nm photon polarization 
is measured, with high timing accuracy, by a 
combination of a polarization analysing stage, a 
PPLN downconverter and an SNSPD. PBS, 
polarizing beamsplitter; HWP, half-wave plate; 
QWP, quarter-wave plate; SM, single-mode; BPF, 
1,560 nm bandpass filter. b, Timing resolution of 
the 1,560-nm conversion technique, measured by 
cross-correlating the 2.2-11m conversion pulse with 
a classical, 3-ps pulse at 910 nm (inset). From these 
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Figure 3 | Quantum-dot manipulation scheme for spin-photon 
entanglement verification. a, Schematic overview of the quantum-dot 
manipulation techniques used in the experiment. ‘100 ps’ indicates the 
resonant, 100-ps laser pulse used to excite the system into the | | | {})-state;‘c.w.’ 
indicates the continuous wave (c.w.) laser used for initialization into the |{)- 
state, and for readout of the | |)-state. do, Spin Larmor precession frequency; 
Q.65 effective spin Rabi frequency, resulting from manipulation by 3-ps, 
detuned optical pulses®’. For the classical correlation measurements, 
manipulation of the spin using the H-branch was used as well, whereas the 
entanglement result was obtained using only V-branch pumping. b, Timing 


than 100 GHz, this is more than sufficient to erase the information 
inherent in the Zeeman energy of 17.6 GHz at 3 T (Supplementary 
Information). In addition, the conversion and filtering technique results 
in almost noise-free signal transduction to the telecom band, as shown 
in Fig. 2c. We measure the polarization state of 910-nm photons using a 
polarization analysing stage consisting of a quarter-wave plate and a 
half-wave plate and a polarizer, followed by a single downconversion 
set-up that provides the necessary timing accuracy (filtering) in order 
to verify entanglement at 910 nm. The polarization selective operation 
of PPLN waveguides does not permit a full photonic polarization qubit 
at 910nm to be directly downconverted to a polarization qubit at 
1,560 nm. However, and as a straightforward extension of the work 
presented in this Letter, mapping the polarization qubit into a dual-rail 
qubit”’, using two separate time-resolved downconverters, would realize 
a spin-entangled 1,560-nm qubit, suitable for long-distance quantum 
communication” (Supplementary Information). 

We analyse spin—photon entanglement using a combination of the 
time-resolved conversion technique and previously established spin 
initialization, manipulation and readout techniques®’. Using 3-ps- 
long, 1-nm red-detuned optical pulses in a Ramsey interferometry 
set-up, we can completely control the state of any arbitrary electron 
spin superposition, with fidelities around 95% for the particular 
quantum dot used in this work (Supplementary Information). The spin 
state is measured using an optical pumping scheme®”’, which is also used 
to initialize the system into the |1)-state. Both processes emit a single 
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diagram of the pulse sequence used in the experiment. Note the three cycles 
used in the experiment: initialization through optical pumping and spin 
rotation, generation of entanglement using 100-ps laser pulses and 
spontaneous emission decay, followed by ultrafast photon conversion to a 
telecom wavelength, and spin measurement through a combination of spin 
rotation (measurement basis change), optical pumping and single-photon 
detection. The cycle time of a single shot of the experiment was chosen to be 
either 39 or 52 ns. c, Schematic diagram of the set-up used in the experiment. 
EOM, electro-optic modulator; NPBS, non-polarizing beamsplitter. See text for 
other components. 


910-nm H-polarized photon that can be detected after frequency and 
polarization filtering from the optical pumping laser*’ (Fig. 3a and 
Methods). For a 13-ns spin interrogation time, readout and initializa- 
tion fidelities of =96% can be inferred from the time-resolved emission 
decay, mainly limited by residual leakage of the continuous-wave laser 
used for optical pumping (Supplementary Information). The full con- 
trol sequence of the optical pulses is indicated in Fig. 3b, and consists of 
three stages. First, the system is initialized into either the | |)-state or the 
| })-state by a combination of optical pumping and spin rotation with a 
m-pulse. Then, a 100-ps optical m-pulse resonant with the ||)-|T | {})- 
transition (or |{)-|T|{})-transition) excites to the trion state, after 
which spontaneous emission occurs into the electron spin states. The 
difference between the spontaneous emission decay time (600 ps) and 
the excitation pulse duration (100 ps) allows for temporal filtering of 
the spontaneously emitted single photons. In addition, for the classical 
correlation measurements (see below), cross-polarization filtering can 
be used to further suppress noise from the reflection of the 100-ps 
excitation laser. The spontaneously emitted single photons are then 
sent to a polarization analysing stage, after which they are converted to 
1,560 nm using the PPLN waveguide, and detected by an SNSPD. The 
next stage of the experiment consists of the spin analysis, which is 
performed through a combination of spin rotations (these implement 
an effective measurement basis change) and the optical pumping/read- 
out cycle. We measure the correlations between the polarization of the 
910-nm photon and the spin state through a timing histogram analysis 
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of the photons detected in the conversion and measurement cycles 
(Fig. 3c and Methods). 

Using our time-resolved downconversion technique, we measure 
the classical (computational basis) correlations between the spin along 
the magnetic field axis (z), and the H-V polarization of the photon 
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Figure 4 | Spin-photon entanglement verification. a, Classical 
(computational basis) spin-photon correlations, measured through 
downconversion to 1,560 nm. The black arrows (see also b, d) refer to the spin 
orientation in the computational (vertical arrows) or rotated (horizontal 
arrows) basis. b, Classical (computational basis) spin-photon correlations, 
measured at 910 nm, without downconversion. ¢, Spin-photon entanglement: 
on time-resolved detection of a «**” -downconverted photon, the electron spin 
starts precessing due to the electron Zeeman energy (Larmor precession). By 
changing the arrival time of the subsequent 1/2-pulse, this coherent spin 
precession can be mapped into a Ramsey fringe, where the population in the 
| |)-state oscillates as a function of time, and depends on the helicity of the 
downconverted photon (the Ramsey fringes for opposite helicities are in anti- 
phase). This oscillatory spin signal then results in an oscillatory, periodic 
coincidence count rate between the downconverted single photon and the spin- 
measurement photon. Blue, raw data from histogram analysis; green, least- 
squares, sinusoidal fit to the data. Inset, relative timings used. 2.2 um indicates 
the 2.2-1m conversion pump pulse; 1/2 shows the 1/2 spin rotation pulse; c.w. 
indicates the continuous wave (c.w.), spin measurement laser at 910 nm. 

d, Spin-photon correlations in the rotated bases, demonstrating entanglement. 
Error bars, +1 s.d. (a-d), due to Poissonian statistics (Meyents ~ 50-100). 
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(Fig. 4a). For an H-polarized photon, we measure an excellent correla- 
tion with the |{)-spin-state in the subsequent spin measurement cycle. 
Likewise, for a V-polarized photon, we see a strong correlation with the 
||)-spin-state. We can directly compare these correlations with the 
ones obtained without ultrafast downconversion, using a commercial 
single-photon detector (Fig. 4b). The results are in good agreement, 
and differ only due to small amounts of residual noise in the down- 
conversion process (Supplementary Information). 

Verification of entanglement requires observation of correlations in 
a rotated basis of the photon polarization and the spin as well. When 
we measure the photon in the circular polarization basis (a), lo )), 


we measure the spin in the basis of I>) = eID +1) and 
1 
|-)= we (\t) —|1)). After detection ofa|a") (\o” ))-downconverted 


photon at time t), the spin is projected into the |<)-state (|>)), which 
subsequently evolves in time due to Larmor precession: 


Wann(t)) = Se (20-11) F1N)) (3) 


Here dw corresponds to the Zeeman frequency of 2m X 17.6 GHz. 
By scanning the arrival time of a 7/2 spin rotation pulse in a Ramsey 
interferometer®’ (Fig. 4c and Methods) we can trace out this coherent 
oscillation, and verify entanglement. From the minima and maxima of 
these coherent oscillations, we can derive, as in Fig. 4d, the photon- 
spin correlations for |¢") (|o )) downconversion. For a particular 
photon polarization (\*), |o” )) and arrival time of the 7/2 spin rota- 
tion pulse, a correlation can be measured, as expected. Subsequently 
changing the arrival time of the /2-pulse by half a Larmor period 
results in an anticorrelation (see also Fig. 4c). Together, these results 
demonstrate spin-photon entanglement. We estimate the entangle- 
ment fidelity** to be 0.8+0.085 (Methods and Supplementary 
Information), which exceeds the classical limit of 0.5 by more than 
three standard deviations**. 

Whereas our entanglement verification technique relies on time- 
resolved frequency conversion, resulting in filtering out only those 
910-nm photons that exactly overlap with the arrival time of the 
2.2-L4m pump pulse, a more generic downconversion technique with 
a continuous wave pump laser at 2.2 um and coincidence detection of 
two photons from separate quantum-dot sources could be used in 
future experiments in order to obtain spin-spin entanglement” (Sup- 
plementary Information). 

We have demonstrated high-fidelity spin-photon entanglement in 
a single InAs quantum dot using an ultrafast downconversion tech- 
nique to the lowest-loss telecom wavelength (1,560 nm). When com- 
bined with ultrafast control of InAs quantum-dot electron spins®’ and 
fast radiative decay of indistinguishable single photons from such 
quantum dots embedded in optical cavities'’’*”°*', our results should 
enable quantum state transfer from a flying qubit to a stationary 
qubit or vice versa, and entanglement distribution between two remote 
matter qubits in long-distance quantum networks. 

We note that, contemporaneously with this work, another group 
verified spin-energy entanglement between an InAs quantum-dot 
electron spin and a spontaneously emitted photon at 966 nm (ref. 30). 


METHODS SUMMARY 


Quantum dot and optical control. All results are obtained from a single quantum 
dot, emitting at 910 nm, embedded in a planar microcavity’. An external magnetic 
field in Voigt geometry splits the electron spin and trion states (Fig. 1a). A 0.68 NA 
aspheric lens focuses the pump and rotation lasers onto the sample. The coherent 
manipulation techniques are identical to those reported previously’, with fidelities 
(initialization, readout, coherent rotation) around 95% or higher (Supplementary 
Information). After initialization, the | | {})-state is excited by a 100-ps mode- 
locked-laser pulse, synchronized with the spin-rotation laser. Photoluminescence 
is collected in a confocal set-up, and split into two branches. One branch is cross- 
polarized and sent through a double-monochromator onto a single-photon counter 
(SPCM) for spin-state analysis. The other branch is sent to a polarization analysing 
stage, after which it is sent to the downconversion set-up. 
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Downconversion, data acquisition and processing. The 2.2-j1m light pulses 
needed for conversion are generated in a PPLN chip by mixing 3-ps, 911-nm 
pulses from the spin-rotation laser with narrowband, continuous wave 1,560- 
nm light. A PPLN waveguide converts 910-nm photons into 1,560-nm photons, 
conditional on overlap with the 2.2-j1m pulses. An SNSPD subsequently detects 
the 1,560-nm photons. The SPCM and SNSPD signals are combined on a timing 
analyser, which allows for signal gating in post-processing. The noise in the con- 
version process is well below the single-photon level (between 4:1 and 10:1 signal- 
to-noise ratio). The correlation data are obtained from the coincidences between 
the downconverted single photons and the single photons used for spin detection, 
through post-processing of the data stream from the timing analyser, and normal- 
ized to uncorrelated events (Supplementary Information). The entanglement 
fidelity analysis follows the same procedure used in ion-trap* and NV-diamond? 
spin-photon entanglement experiments, and is estimated to be around 0.8 + 0.085, 
well above the classical limit of 0.5. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Device design. The quantum-dot sample used is similar to the ones used prev- 
iously to investigate ultrafast optical control and single spin echoes’. Compared to 
those experiments, the dot density was reduced, and the wavelength used was blue- 
shifted (910nm). An asymmetric, low-Q cavity (10nm FWHM, centred at 
910nm) redirects the spontaneous emission preferentially in one direction, 
increasing collection efficiency and reducing required peak powers for coherent 
control’. An external magnetic field (Voigt geometry, perpendicular to the optical 
axis) splits both the electron spin states and the excited, trion states. The resulting 
A-systems are indicated in Fig. 1a, and the polarization selection rules are verified 
by means of polarization-selective photoluminescence (half-wave plate and polar- 
izing beamsplitter, Fig. 1b). 

Spin control and single-photon collection. The quantum-dot device is cooled to 
1.6 K inside a superconducting magnetic cryostat (Oxford Spectromag; the mag- 
netic field used varies between B = 3 and 6 T). A 0.68 NA aspheric lens inside the 
cryostat focuses the pump and rotation lasers onto the sample, which is scanned 
relative to the lens by means of slip-stick piezo-electric positioners (Attocube 
Systems). The coherent manipulation techniques used are identical to those 
reported previously’. A narrowband c.w. laser (New Focus Velocity) is used for 
spin initialization and readout, resonant to the | | )-|{ | })-transition (910.10 nm for 
a3T magnetic field), and switched on and off by a fibre-based EOM (EOSpace). 
The interrogation time is 13 ns, with an initialization and readout fidelity of about 
96% or higher (Supplementary Information). Coherent spin rotations are per- 
formed in a Ramsey-interferometry set-up, using pulses from a mode-locked laser 
(3-ps pulse duration, centre wavelength 911 nm, Spectra-Physics Tsunami), which 
are delayed relative to each other through a retroreflector on a motorized stage. 
Individual pulses are selected through free-space EOMs (Conoptics), which are 
double-passed in order to increase the extinction ratio. The selective excitation of 
the | | })-state before spontaneous emission is realized through a combination of 
optical pumping into the | |)-state, rotation by an optical -pulse into the | |)-state, 
followed by the application of a 100-ps pulse (optical m-pulse) from another mode- 
locked laser (Spectra-Physics Tsunami), resonant with the ||)-|{ | {})-transition. 
This laser is synchronized with the rotation laser (Spectra-Physics Lok-to-Clock 
system). Accurate polarization control limits the probability of excitation into 
the |{ | {})-state to less than 1%. Another fibre-optic modulator allows for pulse- 
picking of the 100-ps excitation pulses. The single-photon photoluminescence is 
collected in a confocal set-up, and split into two branches by a non-polarizing 
beamsplitter. One branch is cross-polarized with respect to the initialization and 
optical pumping lasers, and sent through a double-monochromator onto a single- 
photon counter for spin-state analysis (Perkin-Elmer SPCM; 20% quantum 
efficiency, 170 Hz ungated dark count rate). The other branch is sent to a polari- 
zation analysing stage (quarter- and half-wave plate and polarizer), after which it is 
coupled into single-mode fibres and sent to the downconversion set-up. The 
polarization analysing stage is carefully calibrated, in order to account for residual 
birefringence in the setup. All EOMs are controlled by mutually synchronized 
pulse-pattern generators (76 MHz Tektronix and 10 GHz Anritsu PPG), that are 
themselves synchronized to the repetition rate of the mode-locked lasers. In con- 
trast to previous experiments, no software-based lock-in technique was used. 
Instead, spatial, polarization, wavelength and time-filtering are used to separate 
reflected light from the single photons. 

Downconversion set-up. The 2.2-11m light pulses needed for conversion are 
generated by a difference-frequency generation (DFG) process that mixes the 
3-ps, 911-nm pulses from the mode-locked laser with narrowband, c.w. 1,560-nm 
light in an MgO-doped, PPLN chip. The 1,560-nm light is modulated by a fibre-optic 


modulator, and amplified by erbium-doped fibre amplifiers. After wave mixing, 
the residual 1,560-nm and 911-nm light is filtered out through a combination of 
dichroic and absorptive filters. The resulting pulse width depends on the exact 
power and wavelength used for the DFG process, but is measured to be between 
3 and 8 ps. 

A PPLN waveguide efficiently converts 910-nm, spontaneously emitted 

photons to 1,560-nm photons, conditional on overlap with the 2.2-ym pulses. 
Residual scattered light from the 910-nm and 2.2-41m branches is eliminated 
through a fibre-Bragg grating and a long-pass filter. The 1,560-nm photons are 
subsequently detected on an SNSPD, maintained at 2 K, with 14% system detec- 
tion efficiency, 40-Hz ungated dark count rate and 100-ps FWHM timing jitter. 
Timing analysis is performed on a timing analyser (PicoQuant Hydraharp), used 
in time-tagged time-resolved (TTTR) mode, which allows for accurate gating of 
the signals of both the SPCM and the SNSPD in post-processing, thereby dras- 
tically reducing the effects of dark counts. The overall signal-to-noise ratio is 
limited by residual leakage of the 2.2-\1m light, dark counts within the timing 
response of the SNSPD, and residual reflected light from the 100-ps excitation 
pulses, but is in general well below the single-photon level (signal-to-noise ratio 
ranging between 4:1 and 10:1 for the experiments described in this work). The 
timing of the 2.2-,1m light is chosen such that the subsequent 1/2-pulse arrives well 
within the T; -dephasing time of the quantum dot; similar results can be obtained 
for times up to microseconds by implementing a spin echo technique to overcome 
shot-to-shot dephasing’. 
Data analysis and entanglement fidelity. The correlation data obtained in this 
work are the result of a histogram analysis, performed on the coincidence count 
rate between the downconverted single photons, and the single photons used for 
spin detection (Supplementary Information). The coincidence count rate is 
obtained through post-processing of the TTTR data stream, and comparing the 
coincidences within the same experimental run to those in subsequent, uncor- 
related ones. The repetition rate is set at 39 or 52 ns, and the 0.1% single photon 
efficiency and time-gated frequency downconversion result in a 1,560-nm single- 
photon detection and entanglement generation rate of some 2-5 Hz. In combina- 
tion with another 0.1% single-photon detection efficiency in order to detect the 
spin state, this results in an average coincidence rate of some 2-5mHz. We 
emphasize that these losses are predominantly due to the inefficiency of extracting 
a single photon from the quantum dot, which can be significantly improved by 
accurate cavity design*. The conversion process in itself, while lossy due to the 
aggressive time-filtering in order to obtain good timing resolution, is rather effec- 
tive, with internal quantum efficiencies estimated above 80%, and filtering losses of 
several dB maximum. 

The entanglement fidelity analysis follows the same procedure used in the 
ion-trap* and NV-diamond’ spin-photon entanglement experiments: 


F>F,+F,, 
1 

F,= 3 (part +Pyivy —2,\/PuaPri)> 
iL 


P= 5 (Pot cto —Pot—04-+ + Po-+,0--+ — Po-eo-+)- 


Here py7,4; etc. refer to elements of the spin photon density matrix, which can 
be associated with the observed correlations in our experiment. For the values 
obtained in our work, F, = 0.44 + 0.06, F, = 0.36 + 0.06, from which we obtain a 
bound on the quoted fidelity: F = 0.8 + 0.085. 
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Observation of entanglement between a quantum 
dot spin and a single photon 


W. B. Gao’, P. Fallahi', E. Togan!, J. Miguel-Sanchez! & A. Imamoglu! 


Entanglement has a central role in fundamental tests of quantum 
mechanics’ as well as in the burgeoning field of quantum informa- 
tion processing. Particularly in the context of quantum networks 
and communication, a main challenge is the efficient generation of 
entanglement between stationary (spin) and propagating (photon) 
quantum bits”. Here we report the observation of quantum entan- 
glement between a semiconductor quantum dot spin and the 
colour of a propagating optical photon. The demonstration of 
entanglement relies on the use of fast, single-photon detection, 
which allows us to project the photon into a superposition of red 
and blue frequency components. Our results extend the previous 
demonstrations of single-spin/single-photon entanglement in 
trapped ions’, neutral atoms** and nitrogen-vacancy centres® to 
the domain of artificial atoms in semiconductor nanostructures 
that allow for on-chip integration of electronic and photonic 
elements”*. As a result of its fast optical transitions and favourable 
selection rules, the scheme we implement could in principle gene- 
rate nearly deterministic entangled spin-photon pairs at a rate 
determined ultimately by the high spontaneous emission rate. 
Our observation constitutes a first step towards implementation 
of a quantum network with nodes’ consisting of semiconductor 
spin quantum bits’®’. 

Recent advances have demonstrated the potential of semiconductor 
spin quantum bits (qubits) in the implementation of fast, single- 
qubit'?’* and two-qubit’’ quantum gates. Realization of a large-scale, 
spin-based quantum information processor, however, remains an out- 
standing challenge owing to limitations on the coherent coupling of 
distant qubits. An alternative route is provided by the concept of 
distributed quantum computation, which involves small-scale, few- 
qubit quantum processor nodes interconnected by single photons that 
are quantum correlated with spin qubits'*. Remarkably, photonic circuits 
allowing for efficient optical-photon-based quantum communication 
between the nodes can be fabricated on the same semiconductor chip’. 
For this distributed approach to solid-state quantum information 
processing, as well as for long-distance quantum communication, 
generation of the spin—-photon entanglement that we demonstrate here 
emerges as a key enabling technology. 

Our experiments utilize a single-electron-charged, self-assembled 
InGaAs quantum dot. Figure 1a shows the relevant energy-level dia- 
gram as well as the allowed optical transitions in Voigt geometry, 
where an external magnetic field (B,.) is applied perpendicular to the 
growth direction’. In our scheme, it is possible to generate and verify 
high-fidelity entanglement for 0.2T=B,=1T: the lower limit 
ensures that the quantum dot is optically excited only to the red trion 
state, |T,) (Fig. 1a), and the upper limit is dictated by the detector time 
resolution. In our experiments, we choose B, =0.7 T. All four transi- 
tions between the two ground states and the two excited states (Fig. 1a) 
have identical oscillator strengths and are linearly polarized either 
parallel (H) or perpendicular (V) to the applied field direction’’. The 
ground states of the quantum dot are identified by the orientation of 
the electron spin, with |t) and ||) respectively denoting spins parallel 
and antiparallel to the magnetic field direction. Spontaneous emission 


of an H- or V-polarized photon at frequency @,.q or, respectively, Mpiue 
from the trion state |T,) at rate °/2 brings the quantum dot back into 
the state ||) or, respectively, |). In addition to resonant laser excitation 
of the trion states, our scheme uses non-resonant, right-hand-circularly 
(o* )-polarized, 4-ps-long mode-locked laser pulses that are red-detuned 
from the trion transitions by ~210 GHz: these pulses induce a coherent 
rotation of the electron spin, with minimal trion excitation’’. We 
depict these pulses in Fig. 1a with a time-dependent effective magnetic 
field along the z direction (Supplementary Information). 

The basic principle behind the deterministic generation of a spin- 
photon entangled state is straightforward: following the excitation of 
the quantum dot into the trion state, the radiative recombination projects 
the coupled quantum dot/photonic mode system into the entangled state 


|) = a ((J) oi) +i) }OnesV) (1) 
written in the computational basis, where |@yea;H) and |@piue; V) 
denote propagating single-photon-pulse state vectors with respective 
centre frequencies M;eq and Mpiue and respective polarizations H and 
V. The electronic Zeeman energy satisfies Oz = Wpiue — Ored- 

We use resonance fluorescence measurements” to map the 
optical transitions of the quantum dot. Observation of resonance 
fluorescence from the quantum dot transitions in Voigt geometry 
requires the presence of a second laser field that rotates the electron 
spin, because otherwise spin pumping, which takes place within a few 
nanoseconds”, ensures that the quantum dot remains dark to optical 
excitation. To characterize the quantum dot, we use a pulse sequence 
where two resonant laser pulses, respectively 5 and 1.2ns long, 
are applied in addition to a 4-ps pulse implementing spin rotation 
(Fig. 1b). The resonance fluorescence spectrum that we obtain as we 
scan the resonant laser across the four allowed quantum dot transi- 
tions is depicted in Fig. 1c. Our fast, single-photon detection capability 
allows us to monitor the time dependence of the resonance fluo- 
rescence: as Fig. 1d shows, we can easily distinguish photons that 
are emitted during excitation by the resonant laser both from those 
emitted as a result of (unwanted) trion excitation induced by the 
m-pulse and the background photons from the 4-ps laser. This ability 
to monitor photons generated in definite time windows is crucial for 
the verification of spin-photon entanglement. The experimental set- 
up that allows us to implement two-laser excitation as well as selective 
detection of resonance fluorescence generated in different time windows 
or at different frequencies, or both, is depicted in Fig. le (Methods 
Summary). Suppression of the + -polarized background laser light is 
ensured by a polarizer” that projects the polarization of the trans- 
mitted resonance fluorescence photons to a (left-hand circular). 
Because the spin polarization correlations are erased at this stage, 
entanglement is between the electron spin orientation and the centre 
frequency of the single-photon pulse (such that the photonic states can 
be written as |@;ea) and |@ptue)). 

To demonstrate spin-photon entanglement, we first prepare the 
quantum dot in state |}) with a probability exceeding 87% by applying 
a 5-ns resonant laser field tuned to the | |)<>|T,) transition; we refer to 
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Figure 1 | Quantum dot transitions and experimental set-up. a, The energy- 
level diagram of a single-electron-charged InGaAs quantum dot in Voigt 
geometry. Spontaneous emission following resonant optical excitation of the 
trion state, |T,), with Rabi frequency Q;.; leads to the generation of an 
entangled spin—-photon state. Non-resonant, o* -polarized, 4-ps laser pulses 
implement coherent spin rotation; the effect of sucha laser pulse is equivalent to 
a time-dependent magnetic field, B, .¢¢, that is perpendicular to the external 
field B, (Supplementary Information). b, The pulse sequence used to 
characterize the quantum dot transitions: following a 5-ns resonant laser 
excitation, a 4-ps m-pulse is applied. A 1.2-ns resonant laser excitation 


this laser as the measurement/preparation pulse. We then transfer the 
quantum dot to the ||) state using a 4-ps m-pulse’’. An entangled spin- 
photon pair is then generated by exciting |T,) with a 1.2-ns resonant 
laser pulse, which we refer to as the entanglement pulse; the corres- 
ponding pulse sequence is depicted in Fig. 2a. The whole pulse sequence 
is repeated after 13ns, with the 5-ns resonant laser implementing 
spin measurement for the preceding pulse sequence as well as ensuring 
spin pumping/preparation in the |f) state for the subsequent cycle 
(Fig. 2a). The fact that the average number of photons emitted during 
this spin measurement/pumping time window is ~ 1, together with our 
relatively modest combined collection and detection efficiency of 
~0.1%, implies that we extract heralded spin information”; that is, 
the unlikely event of a photon being detected at the avalanche photo- 
diode (APD) tells us with very high confidence level that the spin 
(before the measurement) is ||), whereas the absence of a detection 
event yields no information. An additional m-pulse (Fig. 2a, dashed 
curve) applied 3 ns after the onset of the entanglement pulse enables 
a photon detection event at the APD during the measurement/ 
preparation pulse to project the spin into |?) rather than ||). 

Figure 2b shows the time-resolved resonance fluorescence at fre- 
quencies Mpiue and Wyeq. The depicted resonance fluorescence starts at 
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completes the pulse sequence, which has a repetition period of 13 ns. a.u., 
arbitrary units. c, Resonance fluorescence spectrum generated by the pulse 
sequence depicted in b as the laser frequency wr, is scanned across the quantum 
dot transitions at B, =0.7 T. d, Time dependence of resonance fluorescence 
obtained when the laser is on resonance with the ||)<>|T,) transition 

(Or = @red); indicated by the red arrow in c. The spike that coincides with the 
4-ps laser pulse is due to incomplete suppression of reflected laser photons. 

e, The schematic of the experimental set-up highlighting the important optical 
elements. BS, beam splitter; CW, continuous wave; EOM, electro-optic 
modulator; FP, Fabry—Pérot; POL, polarizer; QWP, quarter-wave plate. 


the onset of the entanglement pulse, which we take to be t = 0 in Figs 2 
and 3. For all experiments depicted in Fig. 2, we use a Fabry—Pérot 
interferometer to select out photons either at Mpiue OF Mreq before 
sending them to the superconducting single-photon detector (SSPD). 
The (unconditional) resonance fluorescence counts at the two frequen- 
cies are nearly identical in intensity, indicating that the two decay 
channels have equal oscillator strengths. The shape of these single- 
photon pulses is determined by the time dependence of the trion popu- 
lation, which in turn is determined by the excitation laser (through its 
pulse shape and the Rabi frequency, 92;.;) and the spontaneous emis- 
sion time [’~' =650 ps. Figure 2c (green bars) shows the normalized, 
zero-time-delay photon autocorrelation function, g)( 0)=0.02+0.01, 
for photons emitted during the time interval [1.2 ns, 1.64 ns] (Fig. 2b, 
dashed green box). This measurement demonstrates that the unlikely 
two-photon emission events lead to a small degradation of the entan- 
glement. When we measure photon correlations for the whole single- 
photon pulse, we find that g°)(0)=0.16+0.01 (Fig. 2c, black bars). 
This value is limited partly by a residual contribution from the reflected 
laser photons and partly by multiple excitation of the quantum dot 
during the entanglement pulse. We note that verifying the spin-photon 
entanglement in the time interval [1.2 ns, 1.64ns] also allows us to 
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Figure 2 | Measurement of classical spin-photon correlations. a, The pulse 
sequence used to measure the spin—-photon correlations in the computational 
(non-rotated) basis. A 5-ns laser excitation resonant with the | |)<>|T;) 
transition implements spin pumping/preparation in |f); detection of a 
resonance fluorescence event during this measurement/preparation pulse 
projects the spin state to | |). Following a 4-ps-long m-pulse, a 1.2-ns laser, 
resonant with the | |)<>|T;) transition, initiates the generation of an entangled 
spin-photon pair. An additional m-pulse (dashed curve) is applied only to 
obtain the data shown in e. b, Time-resolved resonance fluorescence obtained 
at Myea (red) and Wpiue (blue) using the pulse sequence shown in a: identical 
intensities and pulse shapes at the two frequencies indicate that the detected 
events stem from a single trion state (|T,)) and that the oscillator strength of the 
transitions at @,.q and Mpiye are identical. c, Photon correlation measurement 
carried out on resonance fluorescence generated in a 3-ns time window starting 
at the onset of the entanglement pulse (¢ = 0). The observed g?)(0) =0.16 
(black) is limited by background laser photons. When we carry out photon 
correlation measurements on photons detected during the time interval [1.2 ns, 
1.64 ns] (box in b), we find that g)(0) =0.02 (green). d, Time-dependent 
resonance fluorescence at @,eq (red) and Mpiye (blue), conditioned on detection 
ofa photon during the subsequent measurement/preparation pulse. The strong 
suppression of conditional photon detection events at @pjue demonstrates the 
strong classical correlation between spin measurement yielding ||) and a 
photon detection at yea. e, Same as in d but now with a 7-pulse applied before 
the measurement/preparation pulse, showing strong correlations between spin 
detection in |f) and photon detection at Wpiue- 


avoid the effects of the fluctuating nuclear spin environment on the 
entanglement fidelity. 

Demonstration of spin—photon correlations in the computational/ 
non-rotated basis requires the measurement of classical correlation 
between the spin direction and the photon frequency. This is achieved 
by carrying out a coincidence measurement between resonance fluo- 
rescence photons at either Wpiue OF Wrea Generated immediately after 
the entanglement pulse and those generated during the measurement/ 
preparation pulse. Conditional on the detection of the spin state | |), we 
find that the probability of detecting a blue (@piue) photon at the SSPD 
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is drastically suppressed compared with the probability of detecting a 
red (@;eq) photon (Fig. 2d). The ratio of the red to blue photons in the 
time interval [1.2 ns, 1.64ns] is ~21:3. To quantify completely the 
classical correlations in the computational basis, we introduce a 4-ps 
m-pulse at t= 3ns that makes it possible to condition the single- 
photon colour measurements on detection of the spin in |f). In this 
case, we find that the red-photon detection events are strongly sup- 
pressed (Fig. 2e). The ratio of the blue to red photons in this case, 
however, is only ~10:3, owing to (unwanted) trion excitation by the 
4-ps laser that leads to lower fidelity in the subsequent spin measure- 
ment. From these two measurements, we determine the fidelity of 
the classical correlations to be F,=0.87+0.09 (Supplementary 
Information). 

To verify that the generated spin—photon state is entangled, we next 
show that the correlations between spin orientation and photon colour 
exist in a rotated basis®. In the rotated basis, the relative phase between 
the two components of the entangled state in equation (1) is crucial. 
After generation at time f,, the two components acquire a time- 
dependent phase that stems from the different propagation phase 
factors of the blue and red frequency components of the photonic 
mode. In our experiments, we infer the generation time from the 
detection time, fg, of the single photon at the SSPD, through the rela- 
tion tg = ta —L/c, where L denotes the distance from the quantum dot 
to the detector and c is the speed of light. Hence, given a photon 
detection event at tg, we can a-posteriori write the time dependence 
of the spin-photon entangled state of equation (1) as 


|'(t)) = S (|L) |@rea; He 2°) + iT) |otue; V)) 

The requisite spin measurement is implemented by rotating the 
quantum dot electron spin with either a 1/2-pulse or a 31/2-pulse at 
t=t,=1.7 ns. A photon detection event at the APD during the fol- 
lowing measurement/preparation pulse then projects the electron spin 
into (|!) —i|t))/V2 (/2-pulse) or (||) +i]t))/V2 (32/2-pulse). The 
photonic wavefunction consistent with the spin measurement after the 
m/2-pulse can be written as (|@rea; Hye 24's) — | apie; V)) /V2 
(Supplementary Information). After passing through the polarizer, 
which fixes the polarization of the photon to be (|H) —i|V)) / V2, this 
wavefunction is 


1 


|) = ea (|@reae 12 —'®) — i] Dpiue) ) (2) 


In the experiment, we remove the Fabry—Pérot interferometer and 
directly send the single-photon pulses onto a fast SSPD” with a jitter of 
At=60 ps. With a SSPD that had jitter smaller than 1/az, we could 
determine t, well enough to resolve the time dependence in equa- 
tion (2), which in turn would indicate that the photonic state is pro- 
jected into a superposition of its two frequency components. The 
resulting coincidence probability is P~[1+ sin (@z(t; —tg))|/2. As a 
consequence of the time-dependent relative phase, two single-photon 
states generated at different times, t? and t>, such that w7( te - t) =T, 
allow us to infer the overlap of the photonic state with two orthogo- 
nal states. Random photon detection times together with the fact that 
the single-photon pulse width, t, satisfies wzt > 1, allow us effectively 
to implement projective measurements into orthogonal states such as 
|@rea) at |@piue) : 

Figure 3b shows the coincidence between the single-photon detec- 
tion events induced by the entanglement pulse and the detection 
of a photon during the measurement/preparation pulse, following a 
m/2-pulse: the oscillations with a period given by 2z/mz = 238 ps in 
the conditioned single-photon detection events (Fig. 3b, red squares) 
are in stark contrast to the unconditioned single-photon detection 
events shown in Fig. 2b. This particular time dependence, stemming 
from the conditioned photonic wavefunction given in equation (2), 
constitutes a remarkable manifestation of the quantum coherence of 
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Figure 3 | Demonstration of quantum correlation between the electron spin 
and the emitted single-photon pulse. a, The pulse sequence used to measure 
the spin-photon correlations in the rotated basis: the only difference from 
Fig. 2a is the introduction of a 1/2- or 3n/2-pulse, applied at t = 1.7 ns, that 
rotates the spin measurement basis such that the detection of a photon during 
the subsequent measurement/preparation pulse projects the spin to ||) —i|) 
or, respectively,| |) + i|). The shading indicates the [1.2 ns, 1.64 ns] 
entanglement analysis time window. b, Time-resolved coincidence events 
between the single-photon detection following the entanglement pulse and the 
detection of a photon during the first measurement/preparation pulse 
following a m/2-pulse (red squares). For comparison, an average of coincidence 
events between the spin and photon detection events taking place in 25 different 
excitation/preparation cycles is also shown (black squares). The red curve is a fit 
to the experimental data, and the black curve is the rescaled single-photon pulse 
shown in Fig. 2b. c, Normalized coincidence during the [1.2 ns, 1.64 ns] window 
(box in b); normalization is carried out by dividing the number of counts from 
correlated spin-photon pairs by twice the number of uncorrelated events in the 
corresponding time bin. d, Same as in b but now with a 31/2-pulse. e, Same as in 
c but now with a 31/2-pulse. All error bars represent 1 s.d., deduced from 
Poissonian statistics of the raw detection events. 


the entangled spin—photon system. For comparison, we also show an 
average of coincidence events between the spin and photon detection 
events taking place in different excitation/preparation cycles (Fig. 3b, 
black squares). As expected, these coincidences do not show any oscil- 
lations, but they help us quantify the degree of quantum correlation. 
Normalizing the coincidence detection between correlated spin-photon 
pairs and focusing on the time interval between 1.2 and 1.64 ns, we find 
a visibility of about 45+6% for the observed oscillations (Fig. 3c). 
Figure 3d shows the time-dependent coincidence measurements 
obtained by applying a 3/2-pulse before the measurement/prepara- 
tion pulse. Figure 3e in turn shows that when we focus on the same time 
interval, the oscillations for spin detection along (||) +i|t))/V2 
(Fig. 3e) are m out of phase relative to those along (||) —i]T))/V2 
(Fig. 3c) and have a visibility of 47 + 4%. The actual coincidence de- 
tection rate in the detection time window [1.2ns, 1.64ns] is 240 
counts per hour. Using the data from Fig. 3c, e, we determine the fide- 
lity of the quantum correlations in this rotated basis to be the average 
of these two visibilities, yielding F,=0.46+0.04 (Supplementary 
Information). The overall measured entanglement fidelity is then 
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F>(F\+F,)/2=0.67+0.05, limited predominantly by the finite 
jitter of the SSPD. 

A major limitation in the demonstration of quantum correlations 
between the electron spin and the photon frequency stems from the 
relatively short ground-state spin decoherence time, T3, of the electron 
spin, originating from the slowly fluctuating nuclear spin environ- 
ment. The decay of the oscillations as a function of tf, — tf, in con- 
ditioned single-photon detection events (Fig. 3b, d) constitutes a 
measurement of the electron spin decoherence that was previously 
carried out using Ramsey interferometry”. By fitting the decay of 
coincidence measurements in the time window [0 ns, 1.64 ns] we find 
that T; =1.1+0.2 ns for the data in Fig. 3b and that T; =0.9+ 0.2 ns 
for the data in Fig. 3d, for this quantum dot. 

In our experiments, the extraction efficiency of the spin—photon 
entanglement (that is, the fraction of emitted photon collected by 
the objective) is ~0.1. We emphasize, however, that near-deterministic 
generation of entanglement is possible using InGaAs quantum dots: 
several groups have demonstrated that quantum dot emissive pro- 
perties can be drastically improved by incorporating them in tailored 
photonic nanostructures’*°”*. The fraction of photons emitted at the 
zero-phonon line typically exceeds 90% in self-assembled quantum 
dots; enhanced radiative recombination rate in devices with Purcell 
factors much greater than one ensures that the fraction of photons 
emitted at the zero-phonon line approaches unity*. By embedding a 
quantum dot in a micropillar cavity yielding a Purcell factor exceeding 
one, it is also possible to ensure single-photon collection efficiency 
approaching unity”*. 

Our results realize the long-standing goal of establishing a light- 
matter quantum interface in optically active quantum dots in GaAs- 
based materials”'’. Integration of self-assembled or other optically 
active quantum dots with spin qubits in electrically gated quantum 
dots’* would allow entangled spin—photon pairs to be used to entangle 
distant spin qubits. The relatively short T[ observed in our experiment 
presents a limitation on the separation of distant spin qubits that can 
be entangled. We emphasize that methods for narrowing the fluc- 
tuating Overhauser field’’ could be used to increase T>. Implementing 
such a scheme before spin-photon entanglement generation or using 
spin-echo techniques would extend the distance limit set by T;. Obser- 
vation of indistinguishable photons from two distant, self-assembled 
quantum dots in a Hong-Ou-Mandel interferometer, which was 
demonstrated experimentally**”’, constitutes an additional key step 
towards realization of distant-spin entanglement. We emphasize that 
coincidence detection in a Hong-Ou-Mandel set-up that signals 
successful entanglement swapping can be implemented in a straight- 
forward way using photons from two spin-frequency entangled pairs”. 


METHODS SUMMARY 


Experimental set-up. Two time-delayed 4-ps pulses from a mode-locked laser are 
combined with a 5-ns pulse and a 1.2-ns pulse obtained from a continuous-wave 
laser using an electro-optic modulator, and the resulting single beam is sent to the 
quantum dot. The relative delay of the 4-ps pulses can be changed with a trans- 
lation stage. A polarizer and a quarter-wave plate ensure that the lasers are o* - 
polarized at the quantum dot. Emitted quantum dot photons are detected by an 
APD and a SSPD through two collection paths. In each path, a combination of a 
polarizer and a spectrometer suppresses the reflected laser background. A Fabry- 
Pérot filter is used to select the colour of the entanglement photon for the compu- 
tational basis measurements and is removed for the rotated basis measurements. 
Synchronization. In the experiment, the fixed repetition rate of a mode-locked 
laser is used as a reference to synchronize the time delay of experimental pulses and 
also to synchronize the time of detection events. The repetition rate is measured 
using a fast photodiode (40-ps rise time). The 32nd harmonic of the repetition rate 
is extracted with a band-pass filter centred at 2.4 GHz, amplified (70 dB gain) and 
then fed into the clock input of a pulse-pattern generator (PPG), which is used to 
synchronize both the pulse generation and the detection circuitries. An elec- 
trical pulse output from the PPG is used to drive an electro-optic modulator, 
modulating a single-mode continuous-wave laser and ensuring that the spin mea- 
surement/preparation pulses and entanglement-generation pulses are always in a 
fixed time order. Another output of the PPG is used to pre-scale the 2.4-GHz 
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synchronization signal, before being fed into the clock input of a time-correlated 
single-photon-counting module. This module records the time delay of photon 
arrival events (via the APD or the SSPD) relative to the pre-scaled clock. 
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Spontaneous motion in hierarchically assembled 


active matter 


Tim Sanchez!*, Daniel T. N. Chen’*, Stephen J. DeCamp’, Michael Heymann!? & Zvonimir Dogic’ 


With remarkable precision and reproducibility, cells orchestrate 
the cooperative action of thousands of nanometre-sized molecular 
motors to carry out mechanical tasks at much larger length scales, 
such as cell motility, division and replication’. Besides their bio- 
logical importance, such inherently non-equilibrium processes 
suggest approaches for developing biomimetic active materials 
from microscopic components that consume energy to generate 
continuous motion’~*. Being actively driven, these materials are 
not constrained by the laws of equilibrium statistical mechanics 
and can thus exhibit sought-after properties such as autonomous 
motility, internally generated flows and self-organized beating*’. 
Here, starting from extensile microtubule bundles, we hierarchically 
assemble far-from-equilibrium analogues of conventional polymer 
gels, liquid crystals and emulsions. At high enough concentration, 
the microtubules form a percolating active network characterized 
by internally driven chaotic flows, hydrodynamic instabilities, 
enhanced transport and fluid mixing. When confined to emulsion 
droplets, three-dimensional networks spontaneously adsorb onto 
the droplet surfaces to produce highly active two-dimensional 
nematic liquid crystals whose streaming flows are controlled by 
internally generated fractures and self-healing, as well as unbinding 
and annihilation of oppositely charged disclination defects. The 
resulting active emulsions exhibit unexpected properties, such as 
autonomous motility, which are not observed in their passive ana- 
logues. Taken together, these observations exemplify how assem- 
blages of animate microscopic objects exhibit collective biomimetic 


Microtubules 
eo @F 
Kinesin clusters 


properties that are very different from those found in materials 
assembled from inanimate building blocks, challenging us to 
develop a theoretical framework that would allow for a systematic 
engineering of their far-from-equilibrium material properties. 

We assembled active materials from microtubule filaments, which are 
stabilized with the non-hydrolysable nucleotide analogue GMPCPP, 
leading to an average length of 1.5 1m. Bundles were formed by adding 
a non-adsorbing polymer—poly(ethylene glycol) or PEG—which 
induces attractive interactions through the well-studied depletion 
mechanism. To drive the system far from equilibrium, we added bio- 
tin-labelled fragments of kinesin-1, a molecular motor that converts 
chemical energy from ATP hydrolysis into mechanical movement 
along a microtubule®. Kinesins were assembled into multi-motor clus- 
ters by tetrameric streptavidin, which can simultaneously bind and 
move along multiple microtubules, inducing inter-filament sliding 
(Fig. 1a). In this respect, our experiments build upon important earlier 
work that demonstrated the formation of asters and vortices in net- 
works of unbundled microtubules and kinesin’. However, compared 
to these dispersed networks, the proximity and alignment of depletion- 
bundled microtubules greatly increases the probability of kinesin clus- 
ters simultaneously binding and walking along neighbouring filaments, 
thus enhancing the overall activity. 

Motor-induced sliding of aligned microtubules depends on their 
relative polarity. Kinesin clusters generate sliding forces between 
microtubules of opposite polarity, whereas no sliding force is induced 
between microtubules of the same polarity’. To study the dynamics 


Figure 1 | Active microtubule 
networks exhibit internally 
generated flows. a, Schematic 
illustration of an extensile 
microtubule—kinesin bundle, the basic 
building block used for the assembly 
of active matter. Kinesin clusters exert 
inter-filament sliding forces, whereas 
depleting PEG polymers induce 
microtubule bundling. b, Two 
microtubule bundles merge and the 
resultant bundle immediately extends, 
eventually falling apart. Time interval, 
5s; scale bar, 15 ttm. c, Ina 
percolating microtubule network, 
bundles constantly merge (red 
arrows), extend, buckle (green dashed 
lines), fracture, and self-heal to 
produce a robust and highly dynamic 
steady state. Time interval, 11.5 s; scale 
bar, 15 ttm. d, An active microtubule 
network viewed on a large scale. 
Arrows indicate local bundle velocity 
direction. Scale bar, 80 pm. 
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of active bundles, we confined a dilute suspension of microtubule 
bundles to quasi-two-dimensional (2D) chambers. No sliding is 
observed for isolated bundles, indicating that they are locally polarity 
sorted. However, owing to passive diffusion, quasi-static bundles occa- 
sionally encounter each other and join asa result of attractive depletion 
interactions. Merging bundle domains are equally likely to have the 
same or opposite polarity, the latter case resulting in their relative 
sliding and even bundle disintegration (Fig. 1b). In one case, two 
polarity-sorted bundles met with an initial anti-parallel orientation 
and promptly slid off each other. Subsequently, they rotated by 180° 
and rebound with relative orientation opposite to their original, and 
this time no sliding occurred (Supplementary Video 1). Overall, the 
active microtubule bundles almost exclusively exhibited extensile 
behaviour, in contrast to actin/myosin bundles, which preferentially 
contract'*. Consequently, as we show next, gels constituted from exten- 
sile microtubule bundles are strikingly different from actin/myosin gels, 
which exhibit bulk contraction’*"°. 

Increasing the concentration of extensile microtubule bundles led to 
the formation of percolating bundled active networks (BANs) with 
unique properties (Methods). For example, in contrast to classical 
polymer gels, which respond only passively to externally imposed 
stresses, BANs exhibited internally generated fluid flows (Fig. 1d, 
Supplementary Video 2). These highly robust spatiotemporally chaotic 
flows occurred throughout millimetre-sized samples and can persist 
for up to 24h, limited only by the available chemical fuel (ATP) pro- 
duced through the regeneration system. To gain insight into the micro- 
scopic dynamics that drive the formation of such patterns, we visualized 
constituent microtubule bundles at high magnification. Similar to the 
dilute case, we observed extension of bundles due to internal polarity 
sorting. However, in contrast to the dilute case, the extending micro- 
tubule bundles were now connected to a viscoelastic network. This 
caused them to buckle at a critical length scale’? and subsequently frac- 
ture into smaller fragments which quickly recombined with surrounding 
bundles of random relative polarity, generating further extension. 

Thus, BAN dynamics are determined by cycles of microtubule 
bundles undergoing polarity sorting, extension, buckling, fracturing 
and subsequent recombination (Fig. 1c, Supplementary Video 3). The 
emergent patterns, involving thousands of microtubule bundles, 
are qualitatively similar to those observed in other non-equilibrium 
systems such as swarming bacteria or swimming spermatozoa'*”’. 
Theoretical coarse-grained models of extensile rod suspensions predict 
the occurrence of hydrodynamic instabilities that may underpin such 
large-scale collective effects’. Being constructed from the bottom up, 
the properties of BANs are easily controlled and optimized. For 


example, kinesin walking velocity is tuned by ATP concentration, 
allowing us to examine how active flows depend on the underlying 
microscopic dynamics (Supplementary Video 4)°. 

To quantify fluid flow, we embedded micrometre-sized beads into 
the BANs (Fig. 2a, Supplementary Video 5). The beads were coated 
with a polymer brush to suppress their depletion-induced binding to 
microtubule bundles. In the absence of ATP, the tracer particles 
probed a passive viscoelastic network, and their mean square displace- 
ment (MSD) was sub-diffusive (Fig. 2b). At intermediate ATP con- 
centrations, MSDs became super-diffusive at longer timescales, while 
remaining sub-diffusive at shorter timescales; indicating that the beads 
were being advected by internally generated flows. Similar effects were 
observed for beads suspended in active bacterial baths”. At saturating 
ATP concentrations, MSDs were essentially ballistic; that is, beads 
moved along straight paths. At longer timescales, all MSDs should 
become diffusive; however, this requires measuring long trajectories 
that are not accessible to our imaging system. 

We also examined the spatial structure of internally generated 
fluid flow by calculating the equal-time velocity—velocity correlation 
function between bead pairs, <V(R)eV(0)>, where R is their lateral 
separation (Fig. 2c). The correlation functions decayed exponentially 
and become anti-correlated at approximately 200 |im. The decay of 
<V(R)eV(0)> taken at different ATP concentrations collapsed onto a 
universal curve when rescaled by the peak velocity <V(0)*>, revealing 
that the spatial structure of the internally generated fluid flow is inde- 
pendent of kinesin velocity (Fig. 2c). For saturating ATP concentra- 
tions, the average bead velocity (~2.2 ms ') was approximately 
three times the maximum kinesin velocity’. Previous work on 
unbundled microtubule-motor mixtures described temporally tran- 
sient dynamics before settling into a quasi-static state consisting of 
either aster or vortex defects’. 

BANs are fundamentally different at both macroscopic and micro- 
scopic scales. At macroscopic scales, BANs exhibit steady-state per- 
manent flows that persist for days and are only limited by ATP 
availability and stability of the constituent proteins. At microscopic 
scales, BAN dynamics is driven by cascades of microtubule bundles 
extending, buckling, self-fracturing and self-healing. Such unique 
dynamics inherently demands the presence of a depletion agent, absent 
in previous studies. Furthermore, BANs are very robust, exhibiting the 
same behaviour over a wide range of microscopic control parameters 
(0.4% to 10% PEG, 0.5-5 mg ml ! microtubules, 10 uM to3 mM ATP, 
1.5-120 1g ml~' kinesin). Finally, as we show next, BANs under 
confinement exhibit qualitatively distinct behaviour from those of 
unbundled microtubule-kinesin networks”. 
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Figure 2 | ATP concentration controls dynamics of active microtubule 
networks. a, Tracer particles embedded in BANs indicate local fluid flow. 
Trajectories of the particles (red paths) reveal highly non-Brownian motion. 
Scale bar, 80 jum. b, Mean square displacements of tracer beads plotted as a 
function of time for different ATP concentrations (colour-coded on the figure). 
Lines indicate a diffusive exponent of 1.0 and a ballistic exponent of 2.0. 

c, Normalized spatial velocity—velocity correlation functions as a function of 
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lateral separation for varying concentrations of ATP (colour-coded on the 
figure). The velocities were determined using equal time intervals of 5 s. When 
normalized by the peak velocity <V(0)*>, the correlation functions rescale 
onto a universal curve, revealing a characteristic length scale that is 
independent of ATP concentration. Inset, bare spatial correlation functions 
reveal that average velocity depends on ATP concentration. 
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It is well known that with increasing concentration, rod-like mole- 
cules undergo a transition to a nematic liquid-crystalline phase. To 
explore this regime we created a flat 2D oil-water interface stabilized 
with a surfactant, with BANs being dispersed in aqueous phase. Over 
time extensile microtubule bundles adsorbed onto the PEG brush 
formed by the surfactant molecules, eventually covering the entire 
surface with a dense liquid-crystalline monolayer of locally aligned 
bundles. The adsorbed 2D layer constituted an active microtubule 
liquid-crystalline phase, characterized by fast streaming flows and defect 
unbinding (Supplementary Video 6). Further information about the 
nature of active microtubule liquid crystals can be garnered by exam- 
ining the structure and dynamics of their defects. In general, active liquid 
crystals can have either nematic symmetry**” (as found in monolayers 
of amoeboid cells or vertically shaken monolayers of granular rods**”’) 
or polar symmetry (which is found in motility assays at high concentra- 
tions*”*). Polar liquid crystals form vortex and aster defects’. 

In contrast, we found that active microtubule liquid crystals form 
disclination defects of charge 1/2 or -1/2, implying the presence of 
nematic symmetry (Fig. 3a-c). This is expected because the basic building 
blocks of these materials are symmetrically extensile microtubule 
bundles. In equilibrium liquid crystals, defects are largely static struc- 
tures whose presence is determined by either internal frustrations or 
external boundary conditions. In contrast, defects in active micro- 
tubule nematics exhibited unique spatiotemporal dynamics. They 
were created as uniformly aligned nematic domains extend, buckle 
and internally self-fracture (Fig. 3d), similar to the dynamical cascades 


Figure 3 | Dynamics of 2D streaming nematics confined to fluid interfaces. 
a, Schematic illustrations of the nematic director configuration around 
disclination defects of charge 1/2 (left) and —1/2 (right). b, c, Active liquid 
crystals exhibit disclinations of both 1/2 (top) and —1/2 (bottom) charge, 
indicating the presence of nematic order. Scale bars, 15 um. d, A sequence of 
images demonstrates buckling, folding and internal fracture of a nematic 
domain. The fracture line terminates with a pair of oppositely charged 
disclination defects (red arrow tracks 1/2 disclination; blue arrow tracks — 1/2 
disclination). After the fracture line self-heals, the disclination pair remains 
unbound. Time lapse, 15 s; scale bar, 20 um. 
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of microtubule bundles in 3D active networks. Once created, a fracture 
line terminated with a pair of oppositely charged disclination defects. 
Eventually, even as the fracture self-healed, the defects remained 
unbound and streamed around until eventually annihilating with 
oppositely charged defects (Supplementary Video 6). The rates of 
defect creation and annihilation were balanced, creating steady-state 
streaming dynamics that persisted for many hours. These observa- 
tions exemplify how active nematics are fundamentally different from 
equilibrium ones, in which fractures, internal streaming flows and 
spontaneous unbinding of defect pairs are never observed. 

In a biological context, active fluids are frequently confined to the 
cytoplasm and it has been proposed that such confinement leads to 
emergence of coherent flows that can enhance cellular transport, a 
phenomenon known as cytoplasmic streaming**’’. For this reason 
we encapsulated BANs in aqueous droplets emulsified in fluorinated 
oil. When squeezed between two surfaces, such water-in-oil active 
droplets exhibited an unforeseen emergent property: persistent auto- 
nomous motility. This motility was highly robust, limited only by the 
sample lifetime, which was typically a few days (Fig. 4a, Supplementary 
Video 7). Instead of moving along straight lines, the active emulsion 
droplets preferentially moved in periodic patterns, with their average 
velocities reaching up to ~1 pms. In comparison, droplets without 
any chemical fuel did not exhibit any motion (Fig. 4b). 

To elucidate the microscopic mechanism that drives droplet moti- 
lity, we imaged the internal microtubule dynamics. For small droplets 
(less than 30 tm), microtubule bundles extended and pushed against 
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Figure 4 | Motile water-in-oil emulsion droplets. a, Droplets containing 
extensile microtubule bundles exhibit spontaneous autonomous motility when 
partially compressed between chamber surfaces. A droplet trajectory taken over 
a time interval of 33 min is overlaid onto a bright-field droplet image. Scale bar 
fora and b is 80 tum. b, In the absence of ATP, passive droplets exert no internal 
forces, and the only contribution to their movement is minor drift. 

c, Fluorescence image of active microtubule bundles that spontaneously adsorb 
onto the oil-water interface. The resulting active liquid crystalline phase 
exhibits streaming flows, indicated with blue arrows. The red arrow indicates 
the direction of instantaneous droplet velocity. The image is focused on the 
droplet surface that is in contact with the coverslip. Scale bar, 100 um. d, Image 
of the droplet taken at a midplane indicates that the droplet interior is largely 
devoid of microtubule bundles. Scale bar, 100 um. 
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the water-oil interface, reaching a quasi-static state. Unable to buckle, 
they remained dispersed throughout the droplet and no motility was 
observed. In contrast, when confined within larger droplets, active micro- 
tubule bundles exhibited dramatically different behaviour. Similar to the 
flat surface, they adsorbed onto the oil-water interface, exhibiting 
coherent spontaneous flows (Supplementary Videos 8, 9). When in 
frictional contact with a hard surface, these internal flows drove the 
motility of the entire droplet (Fig. 4c). Scanning through the bulk of 
these motile droplets revealed that their interiors are largely devoid of 
microtubule bundles relative to their surface (Fig. 4d). The critical 
droplet size required for the formation of active liquid crystals was 
roughly similar to the collective buckling length scale that characterizes 
flows of 3D BANs. Spontaneous flows that drive droplet motility bear 
resemblance to cytoplasmic streaming observed in Drosophila oocytes”°. 
Equilibrium nematics confined to spherical surfaces exhibit inter- 
esting defect configurations that have been proposed as building blocks 
for assembly of higher-order equilibrium structures’. Our experi- 
ments provide a new opportunity to explore how dynamic defects of 
active nematics on spheres can be harnessed to control droplet moti- 
lity. At the next level of hierarchy, they also enable future studies in 
which emulsions are used for studying synchronization, crystallization 
and jamming of self-propelled spheres at high concentrations”. 
Conventional polymer gels, liquid crystals and emulsions are quint- 
essential systems of soft condensed-matter physics. Studies of such 
materials have resulted in numerous fundamental advances and tech- 
nological applications. However, the properties of these materials are 
limited by the laws of equilibrium statistical mechanics. Here, starting 
with simple extensile bundles and culminating in the assembly of motile 
droplets, we have demonstrated the hierarchical assembly of active gels, 
liquid crystals and emulsions. Liberated from equilibrium constraints, 
the tunable emergent properties of such far-from-equilibrium materials 
are strikingly different from their passive analogues, enabling further 
development of the statistical mechanics of active matter. 


METHODS SUMMARY 


Microtubules were polymerized to a concentration of 6 mg ml’ with GMPCPP. The 
non-hydrolysing analogue of GTP reduces the microtubule nucleation barrier, re- 
sulting in very short filaments. Left at room temperature, the filaments anneal end- 
to-end, slowly increasing in average length. The behaviour of active microtubule 
networks is very sensitive to the length distribution of constituent filaments. After 
polymerization, microtubules used for active gel experiments were left at room tem- 
perature for two days, while microtubules used to assemble active 2D nematics were 
left for one. They had average lengths of 1.5 jum and 1.5 jm, respectively. Biotin- 
labelled kinesin fragments (401 amino acids of the amino-terminal motor domain of 
Drosophila melanogaster kinesin) were purified from Escherichia coli. Motor clusters 
were assembled by mixing biotin-labelled K401 with tetrameric streptavidin at a 
molar ratio of 1.7:1. Active mixtures consisted of microtubules, kinesin clusters, 
depleting polymer (PEG, 20 kDa), anti-bleaching agents, and an ATP regenerating 
system. The regenerating system used phosphoenol pyruvate (PEP) and pyruvate 
kinase/lactate dehydrogenase (PK/LDH) to maintain the ATP concentration at a 
constant level, allowing us to tune the kinesin velocity by controlling the initial 
ATP concentration. At the highest PEP concentrations, the regenerating system kept 
samples active for tens of hours. Samples were observed in a conventional flow cell 
and surfaces were coated with a repulsive polyacrylamide brush to suppress adsorp- 
tion of proteins onto walls due to depletion or other non-specific interactions. The 
behaviour of active samples was monitored by visualizing microtubules with fluo- 
rescence microscopy, and by tracking 3-j1m beads with bright-field microscopy. 
Beads were coated with poly-L-lysine-poly(ethylene glycol) (PLL-PEG) block copo- 
lymers, which suppressed their sticking to the microtubule bundles. Emulsions were 
made with a PFPE-PEG-PFPE surfactant (E2K0660) inHFE7500 oil. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Tubulin purification and microtubule polymerization. Tubulin was purified 
from bovine brain through two cycles of polymerization-depolymerization in 
high-molarity 1,4-piperazinediethanesulphonic (PIPES) buffer**. The purified 
protein had a concentration of 7.4mg ml! in M2B buffer (80mM _ PIPES, 
pH6.8, 1mM EGTA, 2mM MgCl) and was stored at —80°C. Tubulin was 
subsequently recycled and flash-frozen at a concentration of 13.2mgml~' in 
liquid nitrogen using thin-walled polymerase chain reaction (PCR) tubes. For 
fluorescence microscopy, tubulin was labelled with Alexa Fluor 647 (Invitrogen, 
A-20006) by a succinimidyl ester linker; absorbance spectroscopy indicated that 
29% of monomers were labelled**. Recycled tubulin was copolymerized with 
fluorescently labelled tubulin, producing microtubules with 3% of labelled mono- 
mers. The polymerization mixture consisted of 4.5 pil (13.2 mg ml‘) unlabelled 
tubulin, 0.9 pl (7.4 mg ml ’) Alexa-647-labelled tubulin, 1.4 ul glycerol (to 15% of 
final volume), 0.56 pl GMPCPP (10 mM)* (Jena Biosciences, NU-4056), 0.47 pl 
dithiothreitol (DTT, 20 mM), and M2B buffer to a final volume of 9.4 tl (for a final 
tubulin concentration of 6.8 mgml'). The suspension was incubated for 30 min 
at 37°C, and subsequently diluted to 6mgml | with M2B. Microtubules were 
annealed at room temperature for 2 days before using them for mixing experi- 
ments. This resulted in an average microtubule length of 1.5 jum. 
Kinesin-streptavidin complexes. K401, which consists of 401 amino acids of the 
N-terminal motor domain of D. melanogaster kinesin, was purified as previously 
published®’, The protein was frozen at 0.7mg ml’ in 50 mM imidazole (pH 6.7), 
4mM MgCl, 2 mM DTT, 50 tM ATP and 36% sucrose buffer. Kinesin-streptavidin 
complexes were assembled by mixing 7 ul of freshly thawed K401 solution con- 
taining 3 mM DTT with 8 il of 0.35 mg ml ' streptavidin (Invitrogen, S-888). The 
mixture was incubated on ice for at least 10 min before diluting with M2B to a final 
volume of 28 Ll. 

Polymer-coated sterically repulsive microspheres. Coating microspheres with a 
polymer brush leads to a steric repulsion that prevents their adsorption onto sur- 
faces*’. Negatively charged polystyrene beads were coated with a block-copolymer 
(PLL-PEG), consisting of poly-L-lysine backbone and poly(ethylene glycol) side 
chains. The positively charged lysine groups electrostatically attach to the bead 
surface, leaving the PEG chains extending off the surface. PLL-PEG is synthesized 
by conjugating N-hydroxysuccinimidyl ester of methoxypoly(ethylene glycol) pro- 
pionic acid (Laysan Bio, MPEG-SCM-20K- 1g, molecular mass 20 kDa) with poly- 
G-lysine (Sigma, P7890-100MG, molecular mass 20 kDa) in 50 mM sodium borate 
buffer, pH 8.5. The concentration of lysine amino acids is 10 mM and the concen- 
tration of 20-kDa PEG molecules is 2.86 mM. This mixture is incubated at room 
temperature with gentle mixing for an hour and is subsequently transferred into a 
dialysis membrane (pore size 5 kDa). It is dialysed two times against Millipore- 
purified H,O each time for more than 8 h. 

PLL-PEG block-copolymer was added to 3-1m polystyrene beads decorated 
with sulphate groups in a low salt buffer. The final mixture containing 10 pil PLL- 
PEG, 100 pl HEPES (100 mM, pH 7.5), 12 tl bead stock (8% w/v) and 878 pl H,0 
was incubated for 1 h, stirred gently and sonicated every 20 min. Subsequently, the 
beads were centrifuged for 5 min (5,000g) and resuspended in M2B buffer at 
~1.3% w/v. Beads were imaged using bright-field microscopy and their positions 
were tracked using well-established protocols**. 

Observation chambers for optical microscopy. Microscope glass slides and 
coverslips were coated with a polyacrylamide brush to prevent non-specific adsorp- 
tion of protein’. Slides were first rinsed and sonicated with hot water containing 
0.5% detergent, then with ethanol, and finally with 0.1 M KOH. Subsequently, the 
slides were soaked in a mixture of 98.5% ethanol, 1% acetic acid, and 0.5% of the 
silane-bonding agent 3-(trimethoxysilyl) propylmethacrylate (Acros Organics, 
216551000) for 15 min. Slides were rinsed a final time and immersed in a 2% w/v 
aqueous solution of acrylamide. 35 ul per 100 ml of tetramethylethylenediamine 
(TEMED) and 70 mg per 100ml of ammonium persulphate were added to the 
acrylamide solution to promote polymerization of the polyacrylamide brush that 
is covalently attached to the glass surfaces. Slides and coverslips were stored in a 
suspension of polymerized acrylamide and used for up to two weeks. Each slide was 
rinsed and air-dried immediately before use. 

Assembly of active microtubule bundles. To study the dynamics of dilute micro- 
tubule bundles we confined a low concentration of microtubules to a quasi-2D 
observation chamber. In the absence of molecular motors, depletion interaction 
causes formation of microtubule bundles with mixed polarity. Introducing kinesin 
clusters induces relative sliding and eventual disintegration of bundles in which 
microtubules have mixed polarity while leaving intact bundles in which microtu- 
bules have the same polarity. This leads toa steady state that consists only of bundles 
with the same polarity. In the dilute limit (0.25 mg ml ') the average bundle length 
is 9 um and the average cross-section contains approximately 10-20 microtubule 
bundles. Similar bundle thicknesses are measured in 3D bundled active networks. 
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Active microtubule networks. Several initial mixtures were prepared separately 
and then combined into the final solution. Individual components were dissolved 
and frozen separately in M2B buffer, and then thawed freshly for use. We included 
anti-oxidant components (listed below) and trolox (Sigma, 238813) to avoid photo- 
bleaching during fluorescence imaging. Anti-oxidant solution 1 (AO1) contained 
15mgml ' glucose and 2.5M DTT. Anti-oxidant solution 2 (AO2) contained 
10mg ml * glucose oxidase (Sigma G2133) and 1.75mgml ° catalase (Sigma, 
C40). Second, a high-salt M2B solution (69 mM MgCl) was included to raise the 
MgCl,. Lastly, ATP-regenerating components, including enzyme mixture pyruvate 
kinase/lactate dehydrogenase (PK/LDH, Sigma, P-0294) and phosphoenol pyru- 
vate (PEP) were included in the final mixture. Because ATP is hydrolysed by kinesin 
activity, the PK/LDH uses PEP as a fuel source to convert ADP back into ATP ata 
rate that is much faster than the ATP hydrolysis. Thus constant ATP concentration 
is maintained throughout the experiments until all the PEP is exhausted”. 

The components described above were mixed without microtubules and in 

large volumes to reduce pipetting inaccuracies, while not wasting the valuable 
tubulin. The ‘active pre-mixture’ contained the following: 1.31 AOI, 1.3 pl 
AO2, 1.7 ul ATP (from stocks with different ATP concentrations), 1.7 pl PK/ 
LDH, 2.9 ul high-MgCl,-M2B, 3 ul of sterically repulsive beads (3 um beads, 
~1.3% w/v), 6p trolox (20mM), 8 pl PEP (200mM), 8 pl PEG (6% w/w in 
M2B), 4 Wl kinesin-streptavidin mix, and 12.1 tl M2B buffer. To prepare the final 
active mixture, 5 1l of the active pre-mixture was mixed with 1 1] of the 6 mg ml! 
microtubule solution, resulting in a final microtubule concentration of | mg ml‘. 
2D active microtubule nematics at oil-water interface. A 90-um polyacryla- 
mide flow cell was constructed as described above. HFE7500 oil with 2% (v/v) 
PFPE-PEG-PFPE surfactant was flowed into the cell and immediately displaced by 
flowing in the active microtubule mixture. The oil preferentially wets the interface, 
leaving a perfectly stable flat 2D oil-water interface decorated with the surfactant. 
The BANs were allowed to adsorb onto the oil-water interface and then imaged 
with fluorescent microscopy. Experiments indicate that the PEG is necessary for 
the adsorption of the microtubule bundles onto the surfactant monolayer and that 
active nematics assemble on flat as well as curved surfaces. For these experi- 
ments, 1-day-old microtubules are used, resulting in tighter nematic defects. 
Furthermore, the presence of surfactant is essential for the preservation of the 
streaming state and adsorption is only observed in active samples. In the absence of 
ATP, bundles do not adsorb onto the interface, instead forming a bulk rigid gel. 
Light microscopy. Active microtubule networks were viewed with epi-fluorescence 
microscopy (Nikon Eclipse Ti microscope). AlexaFluor 647-labelled microtubules 
were illuminated with a 120-W metal halide light source (X-cite 120) and a fluo- 
rescent filter cube (Semrock, Cy5-4040B-NTE). Images were acquired with a cooled 
charge-coupled device (CCD) camera (Andor Clara). To viewa large sample area, we 
used the motorized stage and the Ti microscope’s Perfect Focus system to acquire 
adjacent fields of view, which were subsequently stitched together using a MatLab 
routine. 3-p1m tracer particles were imaged with standard bright-field microscopy, 
and particle tracking was performed with Matlab software*. Particle image veloci- 
metry of the bundle flows was performed using the freely available Matlab-based 
package PIVlab (version 1.11). 
Active mixtures in aqueous emulsion droplets. The active microtubule network in 
aqueous suspension was mixed with HFE7500 oil at a ratio of 1:5 and vortexed briefly 
five times to create water droplets within the oil medium. The aqueous droplets were 
stabilized with a 2% v/v solution of PFPE-PEG-PFPE surfactant (E2K0660)*’. Glass 
slides and cover slips were cleaned using the protocol previously described for pre- 
paration of polyacrylamide-coated slides. The emulsion was co-flowed into the micro- 
scope chamber with extra HFE7500 oil to disperse the droplets. Within the droplets, 
the dynamics of microtubule bundles were viewed with fluorescence microscopy, while 
the motion of the droplets themselves was examined with bright-field microscopy. 
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Little change in global drought over the past 60 years 


Justin Sheffield', Eric F. Wood! & Michael L. Roderick? 


Drought is expected to increase in frequency and severity in the 
future as a result of climate change, mainly as a consequence of 
decreases in regional precipitation but also because of increasing 
evaporation driven by global warming’. Previous assessments of 
historic changes in drought over the late twentieth and early 
twenty-first centuries indicate that this may already be happening 
globally. In particular, calculations of the Palmer Drought Severity 
Index (PDSI) show a decrease in moisture globally since the 1970s 
with a commensurate increase in the area in drought that is attri- 
buted, in part, to global warming*’. The simplicity of the PDSI, 
which is calculated from a simple water-balance model forced by 
monthly precipitation and temperature data, makes it an attractive 
tool in large-scale drought assessments, but may give biased results 
in the context of climate change®. Here we show that the previous- 
ly reported increase in global drought is overestimated because 
the PDSI uses a simplified model of potential evaporation’ that 
responds only to changes in temperature and thus responds incor- 
rectly to global warming in recent decades. More realistic calcul- 
ations, based on the underlying physical principles® that take into 
account changes in available energy, humidity and wind speed, 
suggest that there has been little change in drought over the past 
60 years. The results have implications for how we interpret the 
impact of global warming on the hydrological cycle and its 
extremes, and may help to explain why palaeoclimate drought 
reconstructions based on tree-ring data diverge from the PDSI- 
based drought record in recent years”. 

Drought is a major natural hazard that can have devastating impacts 
on regional agriculture, water resources and the environment, with far- 
reaching impacts in an increasingly globalized world"’. The perceived 
abundance of drought in the 2000s, such as long-term events in the 
western United States, southeast Australia and northeast China, and 
the recent short-term but severe events in Russia and the central United 
States, hint that climate change may be a forcing factor and this is only 
likely to get worse, given projected climate warming and precipitation 
changes for the twenty-first century’*. Many studies have attributed 
the severity and length of recent droughts to global warming””*. The 
Fourth Assessment Report (AR4) of the Intergovernmental Panel on 
Climate Change (IPCC) summarized the evidence in the following 
terms: “More intense and longer droughts have been observed over 
wider areas since the 1970s, particularly in the tropics and subtropics. 
Increased drying linked with higher temperatures and decreased 
precipitation has contributed to changes in drought’”’*. The AR4 drew 
heavily from historic analyses of the PDSI, which shows an increase 
in drought in the last few decades, regionally*"* and globally’, that is 
commensurate with the increase in global temperatures. The AR4 sum- 
mary has been substantially revised, however, in the recent IPCC 
Special Report on Extremes’ that notes the over-reliance on the PDSI 
and possible overestimation of the increase in regional and global 
drought. 

The PDSI was developed originally as an agricultural monitoring 
tool in the United States in the 1960s’° that helped in allocating aid to 
stricken farmers. It is used pervasively for operational monitoring and 
increasingly in studies of climate change””’. Its popularity stems from 


its ease of use and long history in agricultural applications. However, 
the PDSI has several shortcomings because of its simplicity*’”"*, 
including the treatment of potential evaporation (PE, the evaporative 
demand of the atmosphere), which is calculated from temperature data 
by using the empirical Thornthwaite equation’. It has been well estab- 
lished that evaporation is a function of more than just temperature, 
and the correct physics includes radiative and aerodynamic controls 
on evaporative demand*'**, Temperature-based PE methods appa- 
rently perform relatively well in climatological applications because air 
temperature is correlated with net radiation and humidity at weekly, 
monthly and subannual timescales”. However, estimating trends is 
problematic and there is extensive literature showing that temper- 
ature-based methods are flawed, inherently because the temperature 
state does not uniquely determine the evaporative flux (see Sup- 
plementary Information). In the context of climate change, the tem- 
perature-based approach responds to recent observed warming with 
increasing PE. When used in the PDSI model, which derives soil 
moisture from the balance between precipitation, evaporation and 
runoff, the increase in PE drives an increase in drought globally’* in 
addition to the impact of any changes in precipitation. However, 
numerous studies based on observations and detailed physical model- 
ling have shown regional declines in evaporative demand over past 
decades as a result of various combinations of declining radiation, va- 
pour-pressure deficit and/or wind speed’, despite generally increasing 
regional temperatures. 

To resolve this discrepancy and provide an improved estimate of 
changes in global drought over the past 60 years based on better 
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Figure 1 | Global average time series of the PDSI and area in drought. 

a, PDSI_Th (blue line) and PDSI_PM (red line). b, Area in drought (PDSI 
<-—3.0) for the PDSI_Th (blue line) and PDSI_PM (red line). The shading 
represents the range derived from uncertainties in precipitation (PDSI_Th and 
PDSI_PM) and net radiation (PDSI_PM only). Uncertainty in precipitation is 
estimated by forcing the PDSI_Th and PDSI_PM by four alternative global 
precipitation data sets. Uncertainty from net radiation is estimated by forcing 
the PDSI_PM with a hybrid empirical-satellite data set?’ and an empirical 
estimate. The other near-surface meteorological data are from a hybrid 
reanalysis—observational data set*'. The thick lines are the mean values of the 
different PDSI data sets. The time series are averaged over global land areas 
excluding Greenland, Antarctica and desert regions with a mean annual 
precipitation of less than 0.5mmd_. 
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physics, we calculate global changes in the PDSI using two methods for 
PE. We use the Thornthwaite algorithm and a physically based estim- 
ate based on the currently accepted Penman-Monteith (PM) equa- 
tion?’ forced by our global meteorological data set and a set of 
alternative precipitation and net radiation data sets (see Methods). 
Because the PM is a more accurate, comprehensive and physically 
based model of PE (see Supplementary Information), it also has greater 
data requirements, which until recently has precluded its use at large 
scales. However, the increasing availability of detailed meteorological 
data from gridded ground observations, satellite remote sensing and 
atmospheric model reanalyses now makes it possible to calculate 
improved estimates that take into account radiative and aerodynamic 
controls. Recent studies have claimed that there is little difference 
between the PDSIs that use the Thornthwaite and PM algorithms 
(PDSI_Th and PDSI_PM, respectively)**** but this can be attributed 
to inconsistencies in the forcing data sets and simulation configuration 
(see Supplementary Information). 

The global averaged time series of PDSI using the two PE methods 
(Fig. 1a) clearly shows the decreasing trend in the PDSI_Th since the 
1970s but not for the PDSI_PM. Uncertainty due to precipitation and 
net radiation is estimated using four different global precipitation data 
sets and two different net radiation data sets, and is represented by the 
shading. The trend for 1950-2008 is —0.037 + 0.004 yr_' (P<0.01) 
and —0.018 + 0.005 yr! (P <0.01) for the PDSI_Th and PDSI_PM, 
respectively. The more recent trend over 1980-2008 is similar for the 
PDSI_Th (—0.032 + 0.008 yr_'; P< 0.01) but is essentially zero for 
the PDSI_PM (0.003 + 0.018 yr’; not significant). The global area in 
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Figure 2 | Trends in the PDSI and PE. a, c, e, Non-parametric trends for 1950- 
2008 in annual average PDSI (averaged over the results using the four precipitation 
data sets and, for the PDSI_PM, also over the two net radiation data sets) from the 
PDSI_Th (a) and the PDSL_PM (c), and their difference (e). b, d, f, Non-parametric 
trends for 1950-2008 in annual average PE from the Thornthwaite equation 
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drought (Fig. 1b) from 1980 to 2008 increases by 0.559 + 0.117% per 
year (P < 0.01) for the PDSI_Th and increases by 0.078 + 0.205% per 
year (not significant) for the PDSI_PM. Despite the smaller overall 
trend in PDSI_PM, there is still an increase in drought area, although 
this is seven times smaller than the PDSI_Th drought area trend. The 
spatial distribution of trends in the PDSI_Th (Fig. 2) shows drying 
across much of the global land, particularly over Africa and eastern 
Asia. In contrast, the PDSI_PM shows a mixture of drying and wetting 
that combines to give a smaller trend globally. The equivalent trends 
in the PE_Th are increasing everywhere (98% of land area), as expected 
given the global increase in temperature, but are a mixture of in- 
creases (58% of land area) and decreases (42%) for the PE_PM. The 
two methods disagree in the sign of the trends across much of northern 
South America, Central America, eastern North America, eastern sub- 
Saharan Africa, western Russia, southern and southeast Asia, and 
Australia. Because some of these regions are water-limited, the impact 
on actual evaporation, and therefore the PDSI, is small. However, in 
energy-limited regions such as northern Eurasia and the Amazon, the 
differences in PE translate into differences in the sign of the PDSI 
trends. 

The results show that previous calculations of the increase in global 
drought are overestimated. However, there are several sources of un- 
certainty in our approach, not least from the errors in the meteoro- 
logical data. We use contemporary data, which are the best that are 
currently available globally, but we recognize that they are not perfect. 
Nevertheless, the regions of decreasing PE trends estimated with the 
PM model are generally in agreement with the abundance of evidence 


Trend (mm d1 yr) 


Deep ee ee oe ee 


-0.02 -0.01 0 0.01 0.02 


(b) and the PM equations (d), and their difference (f). Values are not shown for 
Greenland, Antarctica and desert regions with a mean annual precipitation of less 
than 0.5mm d_. Statistically significant trends at the 95% level are indicated by 
hatching. The difference in trends in e and f and its statistical significance are 
calculated from the time series of differences between the two data sets. 
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of decreasing pan evaporation for many regions’, which has been attri- 
buted to global dimming, decreases in wind speed and in some places 
decreasing vapour-pressure deficit”® (see Supplementary Information 
for further discussion). The temperature-based algorithm does not cap- 
ture these trends. The trends in the radiation, wind-speed and humidity 
data used to calculate the PE_PM are generally consistent with regional 
observations, although increasing trends in downward longwave radi- 
ation and decreasing trends in wind speed from sparse monitoring 
networks are generally underestimated by the global data set. However, 
the possible underestimation of trends in downward longwave radi- 
ation cannot explain the difference between the PDSI_Th and PDSI_PM, 
and underestimation of wind-speed decline implies that the difference is 
probably a conservative estimate (see Supplementary Information). 
Furthermore, uncertainty due to the data on precipitation and net radi- 
ation as estimated from the standard deviation of PDSI trends for dif- 
ferent precipitation and net radiation data sets is much smaller than the 
difference in PDSI trends due to the PE method. As these data sets 
improve and the uncertainties are reduced, the magnitude of the calcu- 
lated trends in drought extent will change; however, the conclusion—that 
poor physical representation of potential evapotranspiration induces 
untenable estimates of long-term changes in drought—will remain. 

Despite the long-standing consensus that the underlying science for 
temperature-based estimates of PE is flawed, compounded by the re- 
sults of this and other studies**'”’ that the flaws are manifested in 
errors in the estimations of the impact of warming on drought and 
hydrology in general, the reasons for the long and continued use of the 
PDSI_Th for climate studies in essentially its original form are a curi- 
osity. The arguments justifying its use are generally based on the 
availability of data, but they are also probably related to its traditional 
use for mapping agricultural drought and allocating drought aid, in 
which comparisons—and thus division of aid—are based on spatial 
maps made over a short period (weeks to months). Those applications 
rely on spatial variations that are driven mainly by precipitation defi- 
cits, and so the PDSI_Th should do a reasonable job of distinguishing 
between regions of more or less drought severity at a given instant. 
However, in the assessment of long-term variations due to climate 
variability or change, a use for which the PDSI_Th was originally 
not designed, it seems that the over-sensitivity to changes in temper- 
ature, and other simplifications, compromises the comparisons in 
time. Palaeoclimate reconstructions of drought may be particularly 
susceptible because they are often developed by scaling tree-ring data 
to match the calculated PDSI_Th for their overlap period. For some 
regions, the tree-ring data, which reflect real variations in climatic and 
non-climatic factors (such as disturbances), diverge from the instru- 
mental-based PDSI_Th in recent decades when warming has been 
most rapid (see, for example, refs 9, 10). Similarly, the “divergence 
problem” as it relates to reconstructions of temperature from high- 
latitude and high-elevation tree-ring data may be associated with the 
assumption that temperature can be used as a surrogate for the con- 
trols on growth through variations in evapotranspiration, notwith- 
standing the competing impacts of other environmental factors (for 
example, higher concentrations of CO,). This can lead to overestima- 
tion of past changes and conversely underestimation of recent trends 
in the context of the past. 

The results of this study have implications for how we interpret the 
role of global warming in changes to the terrestrial hydrological cycle 
and its extremes, such as drought, and how we quantify the impacts of 
future climate change. Several regional studies*’* have suggested that 
higher temperatures than normal were the cause for increased drought 
in recent years through increased evaporation. Yet there is evidence 
that the direct impact of temperature on drying may actually be a 
misinterpretation of feedbacks between the land and the atmosphere. 
Itis more plausible that evaporation actually decreases during drought” 
because of less precipitation, and that drought drives increases in tem- 
peratures because there is less evaporative cooling and thus a higher 
sensible heat flux warming the air**. Short-term temperature anomalies 
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are likely to be a response to drought, rather than a factor in forcing 
drought”. Of concern is if the perceived influence of warming on 
drought as quantified by empirical approaches is extrapolated into 
the future and predictions of the impacts of climate change are likely 
to be overestimated*'”*”®. It is therefore essential to retain a perspective 
on the magnitude of impacts of global warming that is based on our 
physical understanding of the complex relationships between climate 
and hydrological variability. The use of physically realistic hydrological 
modelling merged with the wealth of in situ and satellite-based data 
sources has the potential to give better estimates of changes in global 
drought and its relationship with climate change. 


METHODS SUMMARY 


We quantify drought with the original and self-calibrating version of the PDSI 
model'*, which uses the Thornthwaite algorithm (PDSI_Th), and a modified 
version (PDSI_PM) that uses the PM formulation for PE. The two models 
(PDSI_Th and PDSI_PM) are forced with precipitation and temperature data 
from our global meteorological data set’, which combines atmospheric reanalysis 
data with available remote sensing and ground observations and has been updated 
to 1948-2008. The meteorological data are adjusted to remove spurious trends due 
to observational system changes (see Supplementary Information). We also use a 
set of alternative global precipitation data sets to evaluate the impact of uncertain- 
ties in global precipitation trends on the drought trends, recognizing that precipi- 
tation is the main driver of drought variability but that there are uncertainties in 
precipitation trends at regional to global scales. The PDSI_PM model additionally 
requires radiation, humidity and other near-surface meteorological inputs, which 
are also taken from the updated meteorological data set (see Supplementary 
Information). Trends in annual mean values are calculated by using the non- 
parametric Mann-Kendall test and given as the median value across PDSI data 
sets derived from different precipitation data sets, with the standard deviation 
given after the + sign and estimated by the scaled median absolute deviation. 
An @ value of 0.05 is used to test for significance. The area in drought is calculated 
as the percentage of land area with a PDSI of less than —3. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Palmer Drought Severity Index. Historically, the PDSI’* has been the tool of 
choice when monitoring and analysing drought occurrence, especially in the 
United States, where it is one component of the US National Drought 
Monitor”. It is generally calculated on weekly to monthly timescales and uses 
precipitation and temperature inputs to drive a simplified water-balance model 
with a generic two-layer soil model and meteorology that is normalized with a 
reference set of water balance terms. At continental to global scales its simplicity 
makes it an attractive choice for reconstructing drought records*”’, for which it 
has also been shown to be a proxy for soil moisture‘. It has been used to ana- 
lyse continental-scale to global-scale, long-term variability in drought by several 
studies (see, for example, refs 4, 34, 35). The PDSI is generally well correlated with 
output from more comprehensive hydrological modelling’® but diverges in cooler 
seasons and high latitudes, and substantially so in drier regions. 

Despite this legacy, the PDSI has been shown to be unsuitable for widespread 
application and suffers from simplifications in its physical basis and soil hydro- 
logy’’”’”. For example, its exclusion of cold-season processes makes it unsuitable 
for application in many parts of the world. The usual form of the PDSI algorithm 
uses several empirical constants to characterize the local climate. These were 
originally derived by Palmer using data from a number of climate divisions gen- 
erally located in the midwestern United States and are therefore not representative 
of the whole globe. For example, the criteria for signalling the start and end of a 
drought are arbitrarily based on original data from the midwestern United States. 
Some of the shortcomings have been addressed by a self-calibrating version’® that 
removes the spatial inconsistencies. 

We quantify drought with the original and self-calibrating version of the PDSI 

model'*, which uses the Thornthwaite algorithm (PDSI_Th), and a modified 
version (PDSI_PM) that uses the PM formulation for PE. 
Thornthwaite PE algorithm. PE is modelled in the PDSI by using the temper- 
ature-only-based Thornthwaite method. Thornthwaite’ correlated mean monthly 
temperature with PE, as determined from the water balance for valleys in the 
eastern USA, where sufficient moisture was available to maintain active transpira- 
tion. The Thornthwaite formula for monthly PE (mm) is 


PE = 16d(10T/D* 


where T is the mean temperature for the month (in °C) and dis a correction factor 
that depends on latitude and month. J is the annual thermal index, 


= 2 —/myl.514 
i= ae (11/5) 
where the subscript i refers to the month of the year and a is an empirical factor, 


a= 0.49 + 0.01791 — 0.00007711" + 0.000000675F° 


Penman-Monteith PE algorithm. The PM approach”? is generally accepted as 
the most comprehensive algorithm for modelling potential and actual evapotran- 
spiration (given additional estimates of the plant and environmental resistance to 
atmospheric demand). It is derived from consideration of the equations of the 
surface energy balance by means of elimination of the surface temperature term. It 
forms the basis for the evaporation submodel of many distributed hydrological 
and land surface models, the latter of which often form the land component of 
coupled climate models, and has been used as the basis for regional and global 
retrievals of evapotranspiration based on satellite remotely sensed data (see, for 
example, refs 38-40). The PM equation given below models the diffusion of energy 
from plants or soil against stomatal and aerodynamic resistance, given inputs of 
net radiation, temperature, humidity and wind speed: 


ET= ARpet +(pepD/Ta) 
A+y(l+rs/ta) 


where evapotranspiration (ET) is now in W m_”, which can be converted into mm 
per month by dividing by the latent heat of vaporization of water, 2 (kg '). A 
(PaK™') is the slope of the plot of saturated vapour pressure against air tem- 
perature, Rye is the net radiation (W m ”), pis the density of air (kg m-), Cp is the 
specific heat of air at constant pressure (Jkg 'K™~'), D is the vapour-pressure 
deficit (Pa) and y is the psychometric constant (Pa K_!). r, and r, are the aero- 
dynamic and stomatal resistances (sm_'), respectively. ET collapses to PE when 
the stomatal resistance is zero; the recommended form of the equation”, given as 
the sum of the radiative and aerodynamic components, is 
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Rne —” _6.43(1-+0.536U)D 
ay (+ ) 


A 


PE= —— 
At+y 


where PE is now in mmd_'; U is the wind speed (m s ') at2m height. In 
comparison with the Thornthwaite expression, which is based solely on temper- 
ature, the PM models evaporation as the combination of radiative and aero- 
dynamic processes, thus giving a more realistic estimate and having the potential 
to be influenced by changes in humidity, radiation and wind speed, as well as 
temperature. 

Global meteorological forcing dataset. The two PDSI models (PDSI_Th and 
PDSI_PM) are forced with precipitation and temperature data from our global 
meteorological data set. The PDSI_PM model additionally requires radiation, 
humidity and other near-surface meteorological inputs, which are also taken from 
this data set. The global meteorological forcing dataset*' combines reanalysis data 
and observations to form a global, long-term (1948-2008), 1.0°, 3-hourly data set 
of precipitation, surface radiation and near-surface meteorology. The data set is 
designed for forcing land-surface hydrological models and other physical models 
at large spatial (regional to global) and temporal (annual to decadal) domains. 
Thus the goal of the data set is to ensure robustness of long-term trends and 
variability. At the same time it adjusts the short-term (daily and diurnal) variations 
to match observational data where available and maintains interrelations between 
variables. The data set is based on the NCEP/NCAR reanalysis (NNR)*!, which 
provides continuous records of atmospheric and land variables from 1948 to the 
present. The reanalysis data are corrected to remove biases at diurnal to annual 
timescales by merging with observational data for precipitation, air temperature, 
and shortwave and longwave radiation. Precipitation and temperature are scaled 
to match the Climatic Research Unit (CRU) TS3.0 data set’ on a monthly time- 
scale. The diurnal cycle of precipitation is resampled from a statistical model 
derived from the Tropical Rainfall Measurement Mission (TRMM) Multi- 
Satellite Precipitation Analysis (TMPA) satellite-based data set*’. The diurnal 
range in air temperature is also adjusted to match the CRU TS3.0 data set. For 
1984-2007, the shortwave and longwave radiation are scaled to match the NASA/ 
GEWEX Surface Radiation Budget (SRB) satellite based data set** on a monthly 
timescale. The data set is available from http://hydrology.princeton.edu/data. 
pgf-php. 

Alternative precipitation data sets. A set of four global precipitation data sets is 
used to quantify the uncertainty in global precipitation. The data sets are CPC-Prec/ 
L* (ftp://ftp.cpc.ncep.noaa.gov/precip/50yr/), Global Precipitation Climatology 
Centre V4 (ref. 46) (GPCC; http://gpcc.dwd.de), Climatic Research Unit TS3.0 
(ref. 42) (http://badc.nerc.ac.uk/browse/badc/cru) and Willmott-Matsuura V2.01 
(ref. 47) (http://climate.geog.udel.edu/~climate/html_pages/download.html). 
Trend calculation. Trends in annual mean values are calculated by using the non- 
parametric Mann-Kendall test and given as the median value across PDSI data 
sets derived from different precipitation data sets, with the standard deviation 
given after the + sign and estimated by the scaled median absolute deviation. 
An @ value of 0.05 is used to test for significance. The area in drought is calculated 
as the percentage of land area with a PDSI of less than —3. 
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Slowdown of the Walker circulation driven by 
tropical Indo-Pacific warming 


Hiroki Tokinaga’, Shang-Ping Xie", Clara Deser*, Yu Kosaka* & Yuko M. Okumura® 


Global mean sea surface temperature (SST) has risen steadily over 
the past century’, but the overall pattern contains extensive and 
often uncertain spatial variations, with potentially important 
effects on regional precipitation**. Observations suggest a slow- 
down of the zonal atmospheric overturning circulation above the 
tropical Pacific Ocean (the Walker circulation) over the twentieth 
century’. Although this change has been attributed to a muted 
hydrological cycle forced by global warming*®, the effect of SST 
warming patterns has not been explored and quantified’”*. Here 
we perform experiments using an atmospheric model, and find 
that SST warming patterns are the main cause of the weakened 
Walker circulation over the past six decades (1950-2009). The 
SST trend reconstructed from bucket-sampled SST and night-time 
marine surface air temperature features a reduced zonal gradient 
in the tropical Indo-Pacific Ocean, a change consistent with sub- 
surface temperature observations®. Model experiments with this 
trend pattern robustly simulate the observed changes, including 
the Walker circulation slowdown and the eastward shift of atmos- 
pheric convection from the Indonesian maritime continent to the 
central tropical Pacific. Our results cannot establish whether the 
observed changes are due to natural variability or anthropogenic 
global warming, but they do show that the observed slowdown in 
the Walker circulation is presumably driven by oceanic rather than 
atmospheric processes. 

The Walker circulation is a defining feature of tropical climate. On 
seasonal-to-interannual timescales, it is closely tied to the east-west 
SST gradient along the Equator, as is the El Nifo-Southern Oscillation 
(ENSO). Long-term changes in the Walker circulation have recently 
been the subject of intense debate’*"'*. Most climate models in the 
Coupled Model Intercomparison Project phase 3 (CMIP3) predict a 
slowdown of the Walker circulation under global warming’ ””"”, con- 
sistent with sea level pressure (SLP) measurements during the twen- 
tieth century’**. The Walker circulation weakening relaxes prevailing 
easterly trade winds over the equatorial Pacific>* and suppresses the 
upwelling of nutrient-rich cold water’, affecting fishery stocks in the 
region. Despite these important climatic and biogeochemical influences, 
the mechanism for the observed long-term change in the Walker cir- 
culation has not been rigorously investigated. 

One major hypothesis attributes the Walker circulation slowdown 
to a decrease in convective mass flux under global warming that 
balances a slower increase in global mean precipitation than can be 
expected from the increase in atmospheric water vapour’. This 
muted hydrological cycle mechanism relies only on a globally uniform 
increase in SST*. Overlooked, however, is an alternative hypothesis 
that points to changes in zonal SST gradient across the tropical 
oceans!!!4, This mechanism is dominant for ENSO, but the effort 
to test its role in the long-term weakening of the Walker circula- 
tion has been hampered by uncertainty in observed SST warming 
patterns’”*. For 1950-2009, the zonal SST gradient across the Indo- 
Pacific has intensified in the HadISST1 (Hadley Centre sea ice and SST 


version 1; ref. 15) data set but shows little change in the ERSST3b 
(extended reconstructed SST version 3b; ref. 16) data set (Fig. 1). 
This disagreement is due to the change in SST measurement technique 
and different analysis methods’. 

We reconstruct the SST trend pattern for the period 1950-2009 
(Fig. la), for which various data sets are available with improved 
quality. To reduce biases caused by changes in SST measurement 
technique, we construct an SST data set using only bucket measure- 
ments from the International Comprehensive Ocean—Atmosphere 
Data Set’” (ICOADS). The resulting trend pattern exhibits a reduced 
zonal gradient between the eastern Pacific and the western Pacific/ 
southeastern Indian Ocean (Supplementary Fig. la), in contrast to 
HadISST1 and ERSST3b results (Fig. 1b, c). We also examine night- 
time marine surface air temperature (NMAT), a meteorological para- 
meter that is highly correlated with SST (Supplementary Fig. 2) but 
free of the biases introduced by changes in SST measurement technique. 
The NMAT trend pattern bears a striking resemblance to that of bucket 
SST (Supplementary Fig. 1b, c). This motivates us to estimate trends in 
merged surface temperature (MST) of the bucket SST and NMAT (see 
Methods). The MST trend features a reduced zonal gradient in the 
tropical Indo-Pacific (Fig. 1a, d), physically consistent with a flattened 
ocean thermocline observed by expendable bathythermographs*. 

To investigate the SST gradient effect, we use four atmospheric 
general circulation models (AGCMs; see Methods) and construct a 
multi-model ensemble for an unbiased estimate of the atmospheric 
response. Each AGCM is forced with four different tropical SST trend 
patterns: a spatially uniform SST increase (SUSI; ~0.5°C for 60 yr 
from 1950 to 2009) and those estimated from MST, HadISST1 and 
ERSST3b. In all the experiments including SUSI, the greenhouse-gas 
increase from the 1950s to the 2000s is imposed. The AGCM response 
to SUSI is not an accurate proxy for anthropogenic change, but serves 
as a useful reference to evaluate the SST pattern effect. To validate the 
model experiments, we use a wide variety of data sets, all of which have 
successfully captured regional patterns of tropical climate change’* 
(Methods). We also use 71 ensemble members of the ‘climate of 
the twentieth century’ experiments (20C3M) from 24 CMIP3 models 
and three atmospheric reanalyses for comparison: the US National 
Centers for Environmental Prediction (NCEP)-National Center for 
Atmospheric Research (NCAR) reanalysis’*, the European Centre 
for Medium-Range Weather Forecasting 40-year reanalysis (ERA-40; 
ref. 19), and the Twentieth Century Reanalysis version 2 (20CRv2; ref. 20). 

Over the past six decades, a weakening trend in the Walker circula- 
tion is apparent in SLP change’*” (Fig. 1a and Supplementary Fig. 3a), 
increasing over the maritime continent and decreasing east of 
the Philippines and over the central to eastern tropical Pacific. The 
MST-forced experiments reproduce these SLP change patterns quite 
well, with a slight eastward displacement of negative values over the 
eastern Pacific (Fig. 1d). In contrast, the HadISST 1-forced experiments 
strengthen the Walker circulation’ (Fig. 1b), while the ERSST3b 
experiments show no significant change (Fig. 1c). The SUSI-forced 
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Figure 1 | Observed and simulated patterns of SLP and SST changes for 
1950-2009. Changes in SLP (hPa per 60 yr; contours) and SST (°C per 60 yr; 
shading) from: a, ICOADS and MST; b, HadISST1-forced experiment; 

c, ERSST3b-forced experiment; d, MST-forced experiment; e, SUSI-forced 


experiments simulate a slight weakening of the Walker circulation 
(Fig. le) but its magnitude is much weaker than that of observations’ 
and MST-forced experiments. These AGCM experiments suggest that 
SST trend patterns are key to the Walker circulation change. 

Both the NCEP-NCAR (Fig. 1f) and ERA-40 (Supplementary 
Fig. 3b) reanalyses show an intensification of the Walker circulation 
with SLP increasing over the central equatorial Pacific and decreasing 
over the maritime continent. The SST data sets used in these reanalyses 
exhibit an intensified zonal gradient similar to that in HadISST1, 
suggesting a strong impact of SST forcing even in the NCEP-NCAR 
and ERA-40 reanalyses. The 20CRv2 reanalysis exhibits no significant 
change in SLP gradient despite the assimilation of SLP observations 
(Supplementary Fig. 3c). HadISST1 is used as a surface forcing in 
20CRv2, which we speculate acts to oppose the observed reduction 
in the Walker circulation. 

Changes in atmospheric convection are a clue to the Walker 
circulation slowdown. Observations indicate an eastward shift of 
precipitation/cloudiness from the maritime continent to the central 
tropical Pacific, accompanied by consistent changes in surface wind 
convergence (Fig. 2a, b). The MST-forced experiments simulate these 
patterns of cloud, precipitation and surface wind changes quite well, 
albeit at reduced magnitudes (Fig. 2c, d). The simulated convection 
changes closely follow spatial patterns of SST warming, consistent with 
the “warmer-get-wetter” idea, with support from model projections* 
and satellite observations for the past three decades”’. In contrast, the 
SUSI experiments simulate a precipitation/cloudiness increase over 
the maritime continent (Fig. 2e, f), in disagreement with observations. 
Within the framework of our AGCM experiments, SST warming 
patterns, rather than the hydrological cycle response to uniform 
warming, is the main driver for the Walker circulation slowdown 
and atmospheric convection changes over the past six decades. 

The close linkage between SST and the Walker circulation is readily 
illustrated in a scatter plot of changes in zonal SST and SLP gradients 
(denoted ASST and ASLP, respectively), defined as the zonal difference 
on the Equator between the eastern Pacific (150°-90° W, 5° S-5° N) 
and western Pacific/eastern Indian Ocean (90°-150° E, 5°S-5°N) 
(Fig. 3). Changes in ASST and ASLP are linearly correlated among 
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experiment; and f, NCEP-NCAR reanalysis. The contour interval (CI) for SLP 
change is indicated at the top-right corner of each panel. Positive (negative) 
contours are solid (dashed) lines, and zero contours are thickened. The basin 
averages (40° E-70° W, 20° S-20° N) of SLP change are removed. 


AGCM experiments and reanalyses (r ~ -0.97, significant at P < 0.01), 
with a strong dependence on SST data sets. MST-forced (HadISST1- 
forced) AGCMs robustly simulate a weakening (strengthening) of 
ASLP in response to the ASST weakening (strengthening), whereas 
ERSST3b-forced AGCMs show little change in ASLP. SUSI-forced 
AGCMs tend to weaken ASLP, but this is not robust among 
models and the amplitude is small. Overall, MST-forced AGCMs 
simulate the observed ASLP change more realistically than other 
SST data sets. In addition, ASLP and ASST are highly correlated 
among the 71 simulations in the CMIP3 20C3M ensemble (r ~ -0.83, 
significant at P< 0.01), although the ASLP reduction is smaller than in 
observations’. 

We estimate uncertainty in SST trends arising from observational 
biases based on 100 realizations of SST data in the third version of 
the Hadley Centre gridded SST data set (HadSST3; refs 2, 22). There 
are large variations in ASST among different realizations (Figs 3 
and 4a). We apply empirical orthogonal function (EOF) analysis 
to the 1950-2006 trend patterns over the tropical Indo-Pacific 
(30° E-60° W, 32.5° S-32.5° N) from the 100 realizations. The leading 
EOF reveals large uncertainty in warming magnitude over the equa- 
torial Indo-western Pacific, accounting for 83.6% of the total variance 
among the 100 SST realizations (Supplementary Fig. 4a). The cor- 
relation between the principal component of the leading EOF and 
the ASST trend is -0.75 (significant at P< 0.01), indicating that the 
ASST trend is strongly affected by uncertainty in the magnitude of 
SST warming over the equatorial Indo-western Pacific (Fig. 4a). 
Nevertheless, all the HadSST3 realizations show a reduction in 
ASST. AGCM experiments forced by a subset of HadSST3 realiza- 
tions, including the two extremes (no. 12 and no. 34) and median, 
all simulate the Walker circulation slowdown (Fig. 4b-e), with 
SLP increasing (decreasing) over the maritime continent (tropical 
Pacific). The magnitude of circulation change is proportional to 
that of ASST (green triangles in Fig. 3). These HadSST3 experiments 
provide strong support for our results from observations and MST- 
forced AGCMs. 

Our AGCM results show that the observed Walker circulation slow- 
down is due to the weakened zonal SST gradient across the equatorial 
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Figure 2 | Observed and simulated changes in cloudiness, surface wind and 
precipitation for 1950-2009. Changes in cloudiness (okta per 60 yr for 
observation, % per 60 yr for AGCM; shading in left panels), surface wind (ms — 
per 60 yr; vectors in left panels), and precipitation (mm month’ per 60 yr; 
shading in right panels) from: a, b, observations (ICOADS, WASWind and 


1 


Strengthening Weakening 
Ml ERA4O 
0.6 | ® 
oO 
=| 
Q 
NCEP/NCAR a 
2. 
0.3 | @Q 
= 
io} 
Oo 
5 20CRv2 
a 
& 0.0 no. 12 
— 
er 
a Amed 
~ ag 01 
B no. “@ s 
no. 34 2 
xa 
Let oc sia veces veauneceee 3. 
& 
WP HOOND ES ose ee io anebinen Mesad 
-0.6 -0.3 0.0 0.3 0.6 


ASST (°C per 60 yr) 
@ MST-forced © SUSI-forced @ HadISST1-forced @ ERSST3b-forced 


A HadSST3-forced (ECHAMS5) i Reanalysis CMIP3 20C3M 


Figure 3 | Scatter diagram of ASST and ASLP changes in AGCM and 
CMIP3 model simulations and reanalysis data sets. Confidence intervals 
(error bars) for AGCM experiments are computed from the standard deviation 
of four ensemble members, at the two-sided P = 0.01 level. The twentieth- 
century trends are estimated from each ensemble member of the CMIP3 
20C3M experiment, and scaled to 60-yr trends. Dashed lines indicate 

ASLP changes calculated from ICOADS and HadSLP2. A correlation 
coefficient between ASLP and ASST for AGCM experiments and reanalyses 
(CMIP3) is -0.97 (-0.83), significant at P< 0.01. See key at bottom for 
meaning of symbols. 


rain-gauges); c, d, MST-forced experiment; and e, f, SUSI-forced experiment. 
Only significant changes exceeding the 95% confidence level are shaded in 
colour. Stippling in c, d, indicates regions of the MST warming trend above the 
tropical mean. 


Indo-Pacific, rather than to the muted hydrological cycle under uni- 
form warming”®. This SST pattern mechanism itself is not new'"*"*”° 
but its dominance in tropical climate change over the past six decades 
is surprising, given the prevailing assumption that the Walker circula- 
tion slowdown occurs without the zonal SST gradient change. 
Conversely, the reduced zonal SST gradient is the response to the 
weakened easterly trades®. The circular results suggest a positive feed- 
back between the Walker circulation and SST gradient. In light of 
modest changes in the 20C3M simulations (Fig. 3), the observed 
changes discussed here probably include the influence of natural 
variability superimposed on externally forced response”**. The major- 
ity of CMIP phase 5 (CMIP5) models project a weakened SST gra- 
dient across the equatorial Indo-Pacific in response to increased 
greenhouse-gas forcing, but its magnitude is not significant in the 
twentieth century (Supplementary Fig. 5). On the other hand, the 
increased aerosols due to economic development and biomass burning 
may also have contributed to the reduced SST warming over the Indo- 
western Pacific region*’. An open question still remains: to what degree 
does anthropogenic forcing contribute to the observed changes? A 
conclusive attribution will have to wait for longer observations and 
the growth of the forced response. 

Research into future climate change relies heavily on numerical 
models, but model skill in simulating long-term changes is largely 
untested. Validating climate models against observed change is critical 
in building confidence in their future projections. The slowdown of the 
Walker circulation is a robust change observed for the past century and 
serves a useful testbed for model evaluation. Our results show that 
AGCMs are successful in reproducing the Walker circulation slow- 
down, provided that the correct pattern of SST warming is prescribed. 
Taken together, our SST analysis and AGCM experiments indicate a 
weakening of zonal SST gradient across the equatorial Indo-Pacific 
over the past six decades. Our study highlights uncertainties in the 
tropical Indo-Pacific warming pattern and their effect on the Walker 
circulation change. The effect of SST warming uncertainty may extend 
remotely to densely populated regions such as East Asia through the 
atmospheric bridge** (Supplementary Fig. 4b). Our analysis suggests 
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Figure 4 | Uncertainty in HadSST3 trends. a, Scatter diagram of ASST 
change and the principal component of the leading EOF for the tropical Indo- 
Pacific SST trend patterns. The median and extreme realizations no. 01, no. 12, 
no. 34 of HadSST3 are indicated in red circles. Green circle indicates the 
ensemble mean of the HadSST3 realizations with the confidence interval at the 


that atmospheric reanalyses are also subject to errors of SST warming 
pattern. Their judicial use is advised in studying long-term changes. 
Reducing biases and uncertainty in regional patterns of SST warming 
is a key step for improving the reanalyses and interpretation of long- 
term climate change. 


METHODS SUMMARY 

The MST trends were calculated for 1950-2009 as an unweighted average of the 
bucket SST and NMAT trends from ICOADS release 2.5 (ref. 17) and the Met Office 
Historical Marine Air Temperature 43N (MOHMAT; ref. 15). We performed a set of 
two AGCM experiments, in which the 30-yr-scaled MST trend pattern was either 
added to or subtracted from the HadISST1 climatology over the tropics only 
(20° S-20° N). These two experiments were also forced with greenhouse-gas con- 
centrations averaged in the 2000s and 1950s, respectively. By taking the difference 
between the two experiments, we extracted atmospheric changes over the past 60 yr. 
To investigate the effect of SST warming patterns, the same sets of AGCM experi- 
ments were forced by HadISST1, ERSST3b and HadSST3 trend patterns and SUSI 
(~0.5 °C increase per 60 yr). To obtain an unbiased estimate of the atmospheric 
response, we used four AGCMs: the US National Oceanic and Atmospheric 
Administration (NOAA) Geophysical Fluid Dynamics Laboratory (GFDL) 
AM2.1 (ref. 27), the Max Planck Institute (MPI) ECHAMS (ref. 28), the NCAR 
Community Atmospheric Model (CAM) version 3 (ref. 29) and 4 (ref. 30). The 
AM2.1 (CAMA) uses a finite-volume grid of 2.5° x 2° (2.5° X 1.9°) longitude- 
latitude and 24 (26) vertical levels, and the ECHAM5 (CAM3) uses a T42 grid and 
19 (26) vertical levels. For each experiment the models were integrated for 41 yr with 
the first year of integration discarded as a spin-up. Further details of observational 
data sets, AGCM experiments and analysis methods are discussed in Methods. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

SLP, SST and NMAT. For SLP, we used the ICOADS release 2.5 (ref. 17) and the 
Hadley Centre’s mean SLP data set version 2 (HadSLP2; ref. 31). For SST, we used 
HadISST1 (ref. 15), ERSST3b (ref. 16), HadSST3 (refs 2, 22) and bucket-sampled 
SSTs from ICOADS. For surface air temperature, we analysed only night-time data 
because daytime observations are strongly biased by heating effects over ship 
decks. For NMAT, ICOADS and the Met Office Historical Marine Air 
Temperature (MOHMAT) 43N (ref. 15) were used. All ICOADS data have been 
trimmed using the climatological 4.5 standard-deviation limit that identifies 
potential outliers and then gridded on a 4° grid with an unweighted box average’. 
Missing data in ICOADS were filled with linear interpolation across ‘no data’ grids 
not exceeding 5 points in longitude and 3 points in latitude. This interpolation was 
applied twice to obtain basin-scale characteristics. For the MST trend patterns, we 
first calculated trends in bucket SST and NMAT separately, and then constructed 
an unweighted average of their trend patterns. To improve the readability, 
the MST trend patterns were spatially smoothed with a 5 X 5 point Gaussian 
filter. ICOADS (http://rda.ucar.edu/datasets/ds540.0/), HadSLP2 (http://www. 
metoffice.gov.uk/hadobs/hadslp2/), HadISST1  (http://www.metoffice.gov.uk/ 
hadobs/hadisst/), ERSST3b (http://www.ncdc.noaa.gov/ersst/), HadSST3 (http:// 
www.metoffice.gov.uk/hadobs/hadsst3/), MOHMAT43N (http://www.metoffice. 
gov.uk/hadobs/mohmat/) are all available online. 

Surface wind. The Wave- and Anemometer-based Sea-surface Wind (WASWind) 
data set (http://iprc.soest.hawaii.edu/users/tokinaga/waswind.html) was used. 
This data set substantially reduces spurious upward trends in the ICOADS raw 
winds by applying bias corrections with a stability-dependent height correction for 
measured winds and using wind-wave height. The obtained trend patterns are in 
good agreement with SLP measurements since the 1950s and satellite measure- 
ments for recent decades, illustrating the utility for climate change studies. 
Precipitation and cloudiness. We used four rain-gauge based monthly mean 
gridded products available at NOAA® (http://www.esrl.noaa.gov/psd/data/ 
gridded/data.precl.html), the Global Precipitation Climatology Centre** (ftp:// 
ftp.dwd.de/pub/data/gpcc/html/fulldata_v6_doi_download.html), the University 
of Delaware*’ (http://climate.geog.udel.edu/~climate/html_pages/Global2_Ts 
_2009/Global_p_ts_2009.html), and the University of East Anglia Climate 
Research Unit*® (http://www.cru.uea.ac.uk/cru/data/precip/). All data were 
regridded onto a 3.75° X 2.5° longitude-latitude grid, and precipitation trends 
were calculated for 1950-2007 from an unweighted average of the four products. 
Marine cloudiness trends were calculated for 1950-2009 based on ICOADS. To 
eliminate the bias in ship-observed marine cloudiness, we removed the tropical 
(30° N-30° S) mean trend from each oceanic grid box". 

Estimate of observed trends. All trends were calculated using only well-sampled 
grid boxes that contain monthly means for more than 75% of the total months. To 
suppress interannual variability for each month, all yearly data were smoothed 
with a five-point binominal temporal filter before the trend analysis. We estimated 
long-term trends and their statistical significance with the Sen median slope” and 
the Mann-Kendall** test, respectively, non-parametric methods less affected by 
outliers. The analysis period is basically from 1950 to 2009, but some data sets are 
not available for the entire period. In such cases, we scaled the trends to the 60-yr 
changes. Annual mean trends were presented in all figures. 

AGCM experiments. We used the NOAA GFDL AM2.1 (ref. 27), the MPI 
ECHAMS (ref. 28), the NCAR CAM3 (ref. 29) and CAM4 (ref. 30). The AM2.1 


(CAMA) uses the finite-volume grid of 2.5° X 2° (2.5° X 1.9°) longitude-latitude 
and 24 (26) vertical levels, and the ECHAMS5 (CAM3) uses a T42 grid and 19 (26) 
vertical levels. For each model and SST data set, we performed a set of two 
experiments, in which the 30-yr-scaled SST trend pattern was either added to or 
subtracted from the SST climatology over the tropics only (20° S-20° N). Two SST 
forcing fields were derived as follows: 


SSTwam = SST + 30SST’ 


SSTeoig = SST — 30SST’ 


where SST is the monthly SST climatology obtained from HadISST1, and SST’ is 
the monthly mean SST trends per year estimated from each SST data set. Both SST 
and SST’ were calculated for 1950-2009. For the SUSI experiment, we used a 
tropical mean SST trend of 0.0083 °C increase per year (~0.5 °C per 60 yr) based 
on the HadSST3 estimate. All SST forcing is yearly cycling data so that does not 
include interannual variability. The SSTwarm and SST og experiments were also 
forced with greenhouse-gas concentrations (CO, CHy, NxO, CFC11, CFC12 and 
O3) averaged in the 2000s and 1950s, respectively. By taking the difference between 
the two experiments, we extracted atmospheric changes over the past 60 yr. For 
each experiment the models were integrated for 41 yr with the first year of integ- 
ration discarded as a spin-up. We used only ECHAMS for HadSST3-forced 
experiments. The statistical significance for the difference between SSTwarm and 
SST pola experiments was estimated with the two-sided Student's t-test. 

Other data sets. We used three atmospheric reanalysis products: the NCEP- 
NCAR reanalysis’* for 1950-2009 (http://rda.ucar.edu/pub/reanalysis/index.html), 
the ERA-40 reanalysis!’ for 1958-2001 (http://www.ecmwf.int/products/data/ 
archive/descriptions/e4/index.html), and 20CRv2 (ref. 20) for 1950-2008 (http:// 
www.esrl.noaa.gov/psd/data/gridded/data.20thC_ReanV2.html). We also used the 
CMIP3 ‘Climate of the twentieth century experiments (20C3M)’ (24 models) for 
1900-1999, the CMIP5 historical run (22 models) for 1900-2005, and the CMIP5 
high emission scenario (RCP8.5; 22 models) and medium mitigation scenario 
(RCP4.5; 22 models) for 2006-2098. 
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The global diversity of birds in space and time 


W. Jetz!*, G. H. Thomas”*, J. B. Joy**, K. Hartmann‘ & A. O. Mooers? 


Current global patterns of biodiversity result from processes that 
operate over both space and time and thus require an integrated 
macroecological and macroevolutionary perspective’ *. Molecular 
time trees have advanced our understanding of the tempo and 
mode of diversification*”’ and have identified remarkable adaptive 
radiations across the tree of life**°. However, incomplete joint 
phylogenetic and geographic sampling has limited broad-scale 
inference. Thus, the relative prevalence of rapid radiations and 
the importance of their geographic settings in shaping global bio- 
diversity patterns remain unclear. Here we present, analyse and 
map the first complete dated phylogeny of all 9,993 extant species 
of birds, a widely studied group showing many unique adaptations. 
We find that birds have undergone a strong increase in diversifica- 
tion rate from about 50 million years ago to the near present. This 
acceleration is due to a number of significant rate increases, both 
within songbirds and within other young and mostly temperate 
radiations including the waterfowl, gulls and woodpeckers. 
Importantly, species characterized with very high past diversifica- 
tion rates are interspersed throughout the avian tree and across 
geographic space. Geographically, the major differences in diver- 
sification rates are hemispheric rather than latitudinal, with bird 
assemblages in Asia, North America and southern South America 
containing a disproportionate number of species from recent rapid 
radiations. The contribution of rapidly radiating lineages to both 
temporal diversification dynamics and spatial distributions of spe- 
cies diversity illustrates the benefits of an inclusive geographical 
and taxonomical perspective. Overall, whereas constituent clades 
may exhibit slowdowns’*"’, the adaptive zone into which modern 
birds have diversified since the Cretaceous may still offer oppor- 
tunities for diversification. 

Birds (class Aves) constitute a fascinating and widely-studied radia- 
tion. Analyses based on a very incomplete ‘tapestry’ phylogeny”” sug- 
gested higher speciation and diversification rates in the tropics and in 
South relative to North America®. In addition, numerous geographi- 
cally disparate clades are considered exceptional radiations, including 
both New and Old World warblers in the Northern Hemisphere’®"®, 
island radiations such as Darwin’s finches”’, and the explosively diver- 
sifying white-eyes that span much of the southern Old World’*. The 
prevalence and implications of such rapidly radiating clades have not 
been put in broader context: how characteristic are shifts in diversifica- 
tion, and to what extent do clade-specific and tree-wide variation in 
diversification rate contribute to diversity dynamics across the extant 
tree of the entire class? Where do these radiating lineages occur, and 
how much do they contribute to current-day patterns of diversity in 
the highly diverse tropics compared to relatively depauperate higher 
latitudes? 

We address these questions using the first set of complete phylo- 
genies of extant bird species (9,993 species, see Methods), compiled in 
a Bayesian framework, and a new species-level measure of past diver- 
sification rate. The phylogeny builds on previously established deeper 
relationships and combines molecular data for 6,663 species with 
taxonomic constraints for data-deficient species to more fully account 


for phylogenetic uncertainty. Lineages-through-time® and novel diver- 
sification-rates-through-time plots (Fig. 1) indicate that net diversifi- 
cation leading to extant lineages generally increased from approximately 
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Figure 1 | Diversification of all birds through time. a, b, Estimates of the 
tree-wide lineage net diversification (speciation - extinction) rate (a) and 
speciation rate (b), calculated in 5 million year intervals (line segments). These 
are estimated to be very similar (see Supplementary Discussion). The shaded 
region represents the area between the 5th and 95th quantiles for 525 assessed 
trees with the mean rate traced in black. Intervals outside 67.5 and 2.5 Myr ago 
are not shown due to lack of data (=30 lineages per interval) and the difficulty 
of accounting for ongoing speciation events, respectively. c, Lineage-through 
time plot for 1,000 trees (in grey), with mean waiting times to speciation in 
black. Green background is the tree depicted in Fig. 2. Geologic time periods are 
delineated at the bottom of the plot. Ju, Jurassic period; Qu, Quaternary period. 
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50 million years (Myr) ago to at least 5 Myr ago. This contrasts markedly 
with patterns of decreasing net diversification reported for smaller clades 
for which a model of niche-filling coupled with long-term lineage com- 
petition’’ has been invoked”"®”””, 

By far the best model for diversification (median Akaike 
Information Criterion improvement over all other models >>1,700; 
Supplementary Discussion and Table 1) is one where a subset of indi- 
vidual clades (identified in Fig. 2 and Table 1) are assigned their own 
constant and elevated diversification rates. This confirms a space- and 
time-dependent history of avian diversification, with most of the con- 
sistently identified rate increases occurring within the last ~50 Myr. 
Several of these shifts confirm previously identified highly diverse 
clades with key morphological and behavioural innovations or envi- 
ronmental opportunities, including hummingbirds, parrots and a 
number of songbird lineages. Some additional young and rapidly 
radiating groups with rates of per-lineage species accumulation = 0.25 
species per million years (Table 1) stand out. These include the pre- 
viously unrecognized rapid radiations of both ducks and geese, and of 
select gulls, as well as recognized rapid radiations of ovenbirds plus 
woodcreepers”’, and of white-eyes"*, all with extant diversity ranging 
from 44 to 300 species. Intriguingly, these recently rapidly radiating 
clades show no obvious restriction to parts of the phylogenetic tree 
(Fig. 2) or landmasses. Rapidly and slowly radiating clades are highly 
interspersed. Collectively, these tree-wide distributed bursts produce 
the overall average increase in tree-wide diversification rate (Fig. la). 
Importantly, these novel patterns assume rate constancy within clades, 
and so integrate (rather than conflict with) changing rates associated 
with, for example, diversity- or time-dependent slowdowns at smaller 
scales within individual clades”. 

To understand the consequences of rate variation across the tree for 
the present-day geographic variation in avian diversity in more detail, 
we calculate a new metric of species-level lineage diversification rate 
(DR) for every species. The measure captures the longer-term splitting 
rate leading to a given species, made possible by the full resolution of 
the phylogeny. It is directly related to clade level diversification rate 
measures (see Supplementary Methods), but offers species-level reso- 
lution. Mean species values range from 0.01 to 4.66 species Myr‘ and 
roughly follow a lognormal distribution (Fig. 2, centre) with a geome- 
tric mean of 0.157 species Myr‘ and only limited variation owing to 


Table 1 | Rate shifts found in at least 25% of sampled trees. 
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captured phylogenetic uncertainty (Supplementary Discussion Fig. 4). 
The species level detail illustrates the marked heterogeneity in diver- 
sification rates beyond the main shifts and identifies numerous ‘hot’ 
sections of recent rapid radiations dispersed widely across the avian 
tree of life. 

This heterogeneity extends to the geographical prevalence of species 
with a signature of high past diversification. We find that the main 
geographic differences in diversification rate are east-west hemi- 
spheric, rather than latitudinal’*’* (see below). Average rates are dis- 
tinctly lower in the Eastern (east of 28° W, DR= 0.145, N= 5,810) 
compared to the Western Hemisphere (DR=0.177, N= 4,183, 
Payg < 0.05, f(P < 0.05) = 73/100, Fig. 3). In contrast, there are no 
significant difference in rates (P = 0.69, f(P < 0.05) = 0/100) between 
Northern (DR = 0.161, N = 5,084) and Southern Hemisphere species 
(DR = 0.154, N = 4,909). Intriguingly, avian assemblages in Australia, 
Southeast Asia, Africa and Madagascar are characterized by particu- 
larly low average rates (below approximately 0.12 species Myr ‘ in 
most locations, compared to a global mean of 0.16). These regions also 
harbour substantially fewer than expected of the 25% of species with 
highest diversification rate (<15% in most assemblages, Fig. 3d). One 
hypothesis for the patterns in Australia and Africa is regional density- 
dependent diversification, with early filling of ecological space by 
ancient radiations in their regions of origin’*. Generally, high diver- 
sification rates and large relative prevalence of top diversification rate 
species are found throughout higher-latitude North America, parts of 
north Asia and southwest South America—the main breeding areas of 
several of the rapidly radiating clades identified in Table 1 (including 
warblers, ducks, gulls and woodpeckers). These regions have all been 
characterized by strong climatic fluctuations from the Pliocene to the 
present (with, for example, notable emergence and size fluctuations of 
wetland and forest breeding habitats), which supports the suggestion 
that the geography of past climate dynamics has had a major role for 
today’s diversity patterns’*. Isolated locations such as islands have 
been proposed to strongly facilitate rapid radiations through the 
reduction in gene flow and novel ecological opportunities they often 
present after initial colonization’”**. We confirm this globally, as non- 
pelagic birds with more than half of their range on islands (N = 1,085, 
DR = 0.203) have much larger diversification rates than predomi- 
nantly mainland birds (N= 8,629, DR=0.153; Piyg < 0.001, 


Node English Scientific Age r Species 
G Select gulls Select Laridae 4.6 (0.0) 0.74 44 

U White-eyes, select babblers Zosteropidae, select Timaliidae 8.1 (0.1) 0.49 113 

N Ovenbirds, woodcreepers Dendrocolaptidae, Furnariidae 17.8 (0.1) 0.28 285 

[0] - Ovenbirds - Furnariidae 13.6 (0.1) 0.31 223 
Cc Select ducks, geese Anseriformes 10.8 (0.1) 0.28 109 
i Babblers, white-eyes Timaliidae, Zosteropidae 17.7 (0.1) 0.27 355 
Y Weavers, estrildid finches Ploceidae, Estrildidae 21.9 (0.2) 0.24 278 

H Woodpeckers and allies Ramphastidae, Indicatoridae, Picidae 36.4 (0.3) 0.17 361 

1] - Woodpeckers - Picidae 23.8 (0.2) 0.21 220 
Xx Passeroidea songbirds Passeroidea 35.7 (0.1) 0.20 1,418 

D Select hummingbirds Select Trochilidae 20.7 (0.1) 0.19 266 

R Select sylvioid songbirds Select Sylvioidea 33.3 (0.1) 0.17 1,031 

S$] - Subset - Subset 29.1 (0.1) 0.19 704 
Q Passerida songbirds Passerida 44.7 (0.1) 0.16 3,574 

New World suboscines Tyrannides 52.4 (0.2) 0.16 1,227 
Select Corvida songbirds Select Corvida 32.1 (0.2) 0.16 442 

F Gulls, terns, auks, skuas, jaegers Stercorariidae, Alcidae, Laridae 21.1 (0.1) 0.15 131 
V Muscicapoidea songbirds Muscicapoidea 38.8 (0.1) 0.14 732 

Ww] - subset - Subset 29.4 (0.2) 0.15 603 
J Parrots, songbirds Psittaciformes, Passeriformes 77.2 (0.2) 0.14 6,320 

L] - Most songbirds - Passeriformes 66.8 (0.2) 0.14 5,964 
A Fowl Galloanserae 78.7 (0.2) 0.08 448 

B] - Landfowl - Galliformes 53.9 (0.3) 0.13 267 

K True parrots Psittacidae 35.6 (0.3) 0.13 330 

E Pigeons, doves Columbidae 33.4 (0.1) 0.12 289 

Rate shifts were identified using MEDUSA? (see Fig. 2 for position and prevalence). Pairs of nested shifts are combined when younger shifts (in square brackets) were depressed by the presence of older to <10% 
occurrence. ‘Node’ refers to position on example tree (Fig. 2), Age’ of node is the mean age in units million years (with s.e. calculated across trees) calculated across the posterior sample of trees and ‘r' is the mean 
per lineage diversification rate (in units of Myr” !,s.e. across trees <0.01 for all) estimated for focal nodes of given species richness (‘Species’). All clade-level rate estimates were inferred using Laser?®. The 
background rate as estimated with MEDUSA was r = 0.056 Myr !. Support for nodes was 1.00 for all except U (0.02), T (0.73), S (0.48), P (0.85) and E (0.93). 
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Figure 2 | Diversification across the avian tree. Diversification rate shifts 
identified by MEDUSA and the species level diversification rate metric (DR) are 
displayed on a representative avian tree. Nodes with shifts in diversification rate 
identified in at least 25% of the tested trees are indicated by pie-charts (labelled 
A-Y, see Table 1). Black and grey areas show the proportion of trees with a shift 
at the focal node and with shifts that are nested within (more recent than) the 
focal node, respectively. Shifts are counted only once (for example, shifts at 
node U do not contribute to the prevalence of nested shifts at node T). 
Prevalence of shifts may be lowered by a ‘trickle-down’ effect where 
combinations of nested shifts are rarely identified in the same tree (see also 


f(P < 0.05) = 100/100). Many islands, particularly those inhabited by 
passerines in the Pacific, North Atlantic and Caribbean, stand out for 
their high diversification rates, for example, as shown by the dramatic 
white-eye radiation (Table 1). As might be expected on the basis of 
species richness alone, passerines (N = 5,966, DR = 0.179) have on 
average much higher diversification rate values than non-passerines 
(N = 4,027, DR = 0.131), a difference captured by the diversification 
rate increase observed at the base of the Passeriformes (node ‘L’ in 
Table 1, Fig. 2). Accordingly, passerines have a strong influence on the 
geographic patterns of highest diversification rate (Fig. 3b, c). How- 
ever, both major groups harbour high and low diversification rate 
species and both have major hemispheric differences as well as higher 
rates on islands. Intriguingly though, non-passerines, especially 
through groups such as water birds, and some gulls and woodpeckers, 
drive the higher rates in Asia and also show high rates in North 
America. Passerines show significant recent radiations in temperate 
Asia and North America. In particular due to the ovenbird and wood- 
creeper clades, passerines contribute strongly to high average diver- 
sification rates and present richness in South America, especially south 
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Supplementary Discussion Fig. 6 for discussion). Branches are coloured 
according to the mean diversification rate of descendant branches. We colour 
branches for visualization purposes to highlight tree-wide variation in 
diversification rate and do not analyse values for internal branches. 
Diversification rate quantifies the splitting rate along branches leading to a 
species and offers species-level detail for clade-level diversification rate (see 
Supplementary Methods). The inset shows the scale and frequency distribution 
of diversification rate values across species. Concentric grey circles show time 
from the present in 20 million year intervals. 


of the core rainforest belt. We propose that over the past 10-20 Myr the 
expansion of core temperate habitats* as well as mountain uplifts and, 
more recently, the retreat of high-latitude glaciations (followed by 
range expansions and speciation), all contributed to these distinctive 
patterns. 

These geographic patterns add an important dimension to perspec- 
tives that have emphasized latitude’ as a predictor of both diversifi- 
cation rate and standing diversity. In a highly simplifying global 
latitudinal view (Fig. 4), high diversification rate species in all tropical 
lowland regions seem generally outnumbered by species of lower 
diversification rate. Overall, the simple hypothesis that latitudinal 
diversity differences result from raw variation in net rates by latitude 
is rejected: mean diversification rate shows no significant trend with 
absolute latitude or between the tropics and higher latitudes (Fig. 4). 
This finding is consistent with the hypothesis that the greater com- 
bined age and expanse of tropical moist forests has facilitated greater 
species accumulation without necessarily facilitating greater rates of 
diversification*. The equatorial dip in average diversification rates is 
followed by a peak in the slightly drier and (over the past 10 Myr) 
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Figure 3 | Geographic variation in species-level lineage diversification rate 
and the richness of high-diversification rate species. a—c, Mean assemblage 
diversification rate (see Fig. 2), calculated as the geometric mean of all species in 
a grid cell assemblage, weighted by the inverse of their range size. a, All species; 
b, non-passerines; c, passerines. This visualization limits the overbearing 


climatically more dynamic regions at around 10-15° in both hemi- 
spheres and an overall decrease towards the high latitudes. However, as 
our spatial results indicate, the outcome of any such latitudinal com- 
parisons, excepting perhaps a narrow equatorial band, will depend on 
hemisphere and region, and the inclusion, prevalence and current 
evolutionary dynamics of particular clades (see also ref. 3). 

Our choice of a constant-rate birth model as a prior on diversifi- 
cation is conservative with respect to clade-level rate heterogeneity, 
increasing diversification towards the present and major hemispheric 
differences in lineage-diversification rate. Our results are also robust to 
differing phylogenetic hypotheses, including currently existing 
alternative backbone topologies and the inclusion of data-deficient 
species (see Supplementary Discussion). Indeed, our tree distribution 
was designed to both integrate the latest taxonomic data and to capture 
remaining phylogenetic uncertainty given current knowledge; this 
means it will not be suitable for some questions in avian systematics. 
In addition, the age of origination of crown group birds and the abso- 
lute diversification rates will be contingent on the specific fossil con- 
straints used. Ongoing discovery of cryptic species may further affect 
rate estimates, especially in the tropics where taxonomists have been 
less active. Further, and importantly, our inference is necessarily li- 
mited to lineages leading to present-day species and cannot account 
for the non-random extinction of entire clades (for example, of the 
entire moa clade in New Zealand). Finally, the illustrated spatial pat- 
terns tie diversification rate to the occurrence of species today rather 
than the time of their origination when the distribution of bioclimatic 
zones and species may have been different. Nonetheless, the uncovered 
geographic heterogeneity in diversification rates seems pervasive and 
provides a first global integration of species-level diversification rate 
variation across both time and space. It may be that birds are distinc- 
tive in their pattern of an increasing diversification rate from ~50 Myr 
ago to the recent: the overall adaptive zone into which modern birds 
have diversified since at least the early Eocene may not be saturated, 
and opportunities for diversification may be expanding. Alternatively, 
if the evidence for saturation in other taxa is not due to sampling 


(pseudo-replicating) effect wide-ranging species have on perceived spatial 
patterns of assemblage summaries*. d-f, Relative (d) and absolute (e) richness 
of top 25% diversification rate species (DR = 0.243 species Myr’ '); f shows the 
richness of all 9,993 bird species for comparison. Grid cell size is 110 X 110 km 
for all panels (Behrman projection). 
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Figure 4 | Latitudinal gradient in species-level lineage diversification rate. 
Each black point represents a single species diversification rate (DR) at the 
centroid latitude of its global breeding range. Only the 75% species with small to 
moderate latitudinal extent (<26.4°, N = 7,493) are included to ensure 
comparable centroid positions (see Supplementary Discussion Fig. 7 for very 
similar patterns and results including all species). There is no significant 
association between diversification rate and absolute centroid latitude 

(Payg = 0.51, f(P < 0.05) = 0/100) or for intra- (<23° latitude) against extra- 
tropical centroid location (Payg = 0.16, f(P < 0.05) = 31/100). The solid line is a 
loess smooth over all data (span = 0.2, degree = 2). The dashed line indicates 
the threshold identifying the quartile of species with highest diversification rate 
(DR = 0.243 Myr‘). Darker brown shading highlights greater density of 
species points. 
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issues”®, then the difference may be due to the novel combined scale 
and resolution of our analyses, and smaller clades that are geogra- 
phically or ecologically bounded may indeed saturate"*’. We will need 
to compare global complete trees such as the one we present here with 
well-characterized component clades. We predict that constraint- 
induced slowdowns in net diversification will appear at smaller eco- 
logical and geographic scales, but that these do not combine to produce 
the same pattern at broader scales. This would be consistent with a 
classical view of multiple and ongoing, individually constrained adap- 
tive radiations as a main driver of biodiversification”’. 


METHODS SUMMARY 
Phylogenetic tree construction. Pseudo-posterior samples of complete avian trees 
were assembled as follows. (1) Every bird species was assigned to one of 158 clades 
identified using a backbone phylogeny”. (2) Relaxed-clock trees were generated for 
each clade from sequence data. (3) Relaxed-clock trees for entire clades were gen- 
erated combining species with and without genetic data: species without genetic 
information (3,330) were placed within their clade using constraint structures 
consistent with consensus trees from step (2) plus taxonomic information and 
branching times sampled from a pure birth model of diversification. (4) Final trees 
were assembled from the clade distributions plus samples of dated backbone trees 
from (one of two) distributions constructed using relaxed molecular clock tech- 
niques, 15 genes, ten fossil constraints and extensive topology constraints derived 
from published sources. For further information see Supplementary Methods, for 
trees, see Supplementary Results or http://birdtree.org. 
Diversification analyses. We compared diversification models on 525 sampled 
trees: seven models fit smooth changes in rate**, one searches for tree-wide rate 
shifts and one identifies clades that have significantly elevated or depressed 
diversification rates relative to the entire tree*. To visualize diversification through 
time we estimated tree-wide speciation and extinction rates in five-million-year 
intervals using TreePar”’. 

We measured the species-level lineage diversification rate (DR) for every species 
i as the inverse of its mean equal splits measure’’: 


where N; = number of edges on path from species i to the root and J; = length of 
the edge j. 

Comparative diversification rate analyses. For comparisons of diversification 
rate we used phylogenetic generalized least squares across 100 trees and list average 
P values and frequency (f) of trees with P < 0.05. For the island—mainland com- 
parison we excluded 279 predominantly pelagic species. 

Spatial analysis. We compiled non-invasive breeding distributions from the lite- 
rature and select updates (see Supplementary Methods). We extracted these ranges 
over a 110 km equal area grid in Behrman projection excluding cells with <30% 
dry land or without off-shore islands, resulting in 2.43 x 10° occurrence records 
over 12,850 cells. 
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An ultraviolet-radiation-independent 
pathway to melanoma carcinogenesis in the 
red hair/fair skin background 
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Martin C. Mihm‘, Jennifer A. Wargo°, Kathleen C. Robinson!, Suprabha P. Devi", Jillian C. Vanover’, John A. D’Orazio’, 
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People with pale skin, red hair, freckles and an inability to tan—the 
‘red hair/fair skin’ phenotype—are at highest risk of developing 
melanoma, compared to all other pigmentation types’. Genetically, 
this phenotype is frequently the product of inactivating poly- 
morphisms in the melanocortin 1 receptor (MCI1R) gene. MCIR 
encodes a cyclic AMP-stimulating G-protein-coupled receptor 
that controls pigment production. Minimal receptor activity, as in 
red hair/fair skin polymorphisms, produces the red/yellow pheome- 
lanin pigment, whereas increasing MC1R activity stimulates the pro- 
duction of black/brown eumelanin’. Pheomelanin has weak shielding 
capacity against ultraviolet radiation relative to eumelanin, and 
has been shown to amplify ultraviolet-A-induced reactive oxygen 
species* °. Several observations, however, complicate the assumption 
that melanoma risk is completely ultraviolet-radiation-dependent. 
For example, unlike non-melanoma skin cancers, melanoma is not 
restricted to sun-exposed skin and ultraviolet radiation signature 
mutations are infrequently oncogenic drivers’. Although linkage 
of melanoma risk to ultraviolet radiation exposure is beyond doubt, 
ultraviolet-radiation-independent events are likely to have a signifi- 
cant role’’. Here we introduce a conditional, melanocyte-targeted 
allele of the most common melanoma oncoprotein, BRAFY =, into 
mice carrying an inactivating mutation in the McIrgene (these mice 
have a phenotype analogous to red hair/fair skin humans). We 
observed a high incidence of invasive melanomas without providing 
additional gene aberrations or ultraviolet radiation exposure. To 
investigate the mechanism of ultraviolet-radiation-independent 
carcinogenesis, we introduced an albino allele, which ablates all 
pigment production on the McIr“ background. Selective absence 
of pheomelanin synthesis was protective against melanoma develop- 
ment. In addition, normal MciIr” mouse skin was found to have 
significantly greater oxidative DNA and lipid damage than albino- 
Mcir“ mouse skin. These data suggest that the pheomelanin pigment 
pathway produces ultraviolet-radiation-independent carcinogenic 
contributions to melanomagenesis by a mechanism of oxidative 
damage. Although protection from ultraviolet radiation remains 
important, additional strategies may be required for optimal mela- 
noma prevention. 

To study the role of pigmentation in BRAF melanoma develop- 
ment, we used a series of genetically matched mice on the C57BL/6 
background with various pigmentation phenotypes (Fig. 1a). To mimic 
dark-skinned individuals with a high eumelanin-to-pheomelanin 
ratio, we used mice with the wild-type C57BL/6 pigmentation pheno- 
type (‘black’). To mimic individuals with the red hair/fair skin pheno- 
type who carry a high pheomelanin-to-eumelanin ratio, we used mice 


‘V600E 


with premature termination of the McIr transcript (McIr“’, ‘red’)®. To 
mimic individuals with albinism who have no melanin, we used mice 
with an inactivating mutation at the tyrosinase locus (Tyr, ‘albino’)’. 
Because tyrosinase is the initial and rate-limiting enzyme in melanin 
synthesis, albino melanocytes do not produce any pigment, but are 
normal in number and viability’’. 

We generated two variants of each pigmentation phenotype. One 
variant contains melanocytes in the dermis. A second matched variant 
contains transgenic stem cell factor expressed under the keratin 14 
promoter (K14-SCF), which mimics SCF expression in human epi- 
dermal keratinocytes and results in epidermal melanocyte localization". 

To create a genetic context primed for the induction of melanoma 
we also introduced into each of our six variants a system for inducible, 
melanocyte-specific expression of oncogenic BRAFY®™ (ref. 12). In 
humans, mice and zebrafish, expression of BRAFY O° in melanocytes 
primarily causes benign nevi, rather than melanoma’*”’. In this con- 
text, malignant melanoma progression is thought to be constrained by 
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Figure 1 | Without ultraviolet radiation, Braf“ red mice have an increased 
rate of melanoma development relative to black and albino Braf animals. 
a, C57BL/6 pigmentation variants with epidermal melanocytes (K14-SCF). 
From left to right: black (wild type), red (Mclr’”’) and albino (Tyr). 

b, Genotype of animals used for experimental studies. c, Percentage survival of 
pigmentation variants not carrying the K14-SCF transgene, that is, no epidermal 
melanocytes (Mpjack = 28, Mrea = 40; Natbino = 48)- Ppiack-albino = 0.250, 
Pplack-red = 0.003, Patbino-rea = 0.003. d, Percentage survival of pigmentation 
variants carrying the K14—SCF transgene, that is, epidermal melanocytes 
(Mplack = 49, Nred = 77; Nalbino = 41). Potack-albino = 9-103, Pyiack-rea = 0.009, 
P.tbino-red < 0.0001. 
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oncogene-induced senescence’. Consistent with this, expression of 
BRAFY®”” in conjunction with silencing of PTEN, TP53 or CDKN2A 
leads to development of malignant melanoma’**. However, spon- 
taneous progression of BRAFY”?-expressing melanocytes to malig- 
nant melanoma has been reported following a long latency period in 
C57BL/6 mice, although this phenomenon was not seen on an outbred 
mode 12,16,17° 

We initially produced six groups of BRAF’ °°” inducible (‘Braf“”) 
mice representing three pigmentation variants (‘black’, ‘red’ and 
‘albino’) with or without epidermal melanocytes (+/— K14-SCF, 
Fig. 1b). Melanocyte-selective expression of BRAFY°°*” was achieved 
by tamoxifen-mediated activation of Cre recombinase (Tyr-Cre(ER) 2) 
in adult mice carrying the Braf“ allele. The animals were then followed 
without environmental genotoxic stressors (such as ultraviolet radi- 
ation). Black and albino Braf“* mice developed similarly low rates of 
melanoma after a long latency (regardless of K14-SCF status). In con- 
trast, red Braf~“ mice developed melanomas at an accelerated rate 
with >50% having tumours after 1 year, regardless of K14-SCF status 
(Fig. 1c, d). 

The tumours on the black, red and albino backgrounds were grossly 
amelanotic and largely on the dorsal trunk (within the tamoxifen- 
treated areas). Occasionally, a tumour would develop on the ventral 
trunk, tail or paw, which may reflect a predictable spread of tamoxifen 
secondary to grooming. (Fig. 2a-c). The tumours, which were prim- 
arily dermal, were generally amelanotic on the red and albino back- 
grounds, whereas the melanomas in black mice often had superficial 
pigmentation adjacent to the epidermis (Fig. 2d-f). Regardless of 
pigmentation background, the tumours were histologically similar 
with spindle cell features which were not easily distinguishable from 
tumours on C57BL/6 Braf “-Pten"*"* animals generated in parallel 
(compare Fig. 2 with Supplementary Fig. 1). On closer examination, occa- 
sional red-BRAFY®*” tumour cells were found to contain melanin 
(Fig. 2g, h). It was further possible to increase pigmentation in the 
most superficial melanoma cells with topical application of forskolin, 
an adenylate cyclase agonist known to stimulate skin pigmentation'® 
(Fig. 2i). The limited induction of pigmentation is likely related to 
the poor tissue penetration of forskolin, but nonetheless demonstrates 
the ability of the melanoma cells to become hyper-pigmented in vivo 
upon activation of cAMP signalling. 

Tumours on all three pigmentation backgrounds stained positively 
for S100, a standard immunohistochemical melanoma marker 
(Fig. 2j). In addition, reverse transcriptase polymerase chain reaction 
(RT-PCR) showed that the tumours consistently express the melano- 
cytic pigment genes M-Mitf, Dct, Tyrp1 and Tyr (Fig. 3a, d and data not 
shown). In addition, occasional HMB45* cells could be found by 
immunofluorescence (Supplementary Fig. 2). The tumours on all three 
pigmentation backgrounds were locally invasive to fat and skeletal 
muscle with active mitoses. Although no gross visceral organ meta- 
stases were observed, small clusters of cells expressing gp100, the pre- 
melanosome-associated glycoprotein (Pmel/gp100/HMB45), could be 
found in skin draining lymph nodes (Fig. 2k). 

Using a primary cell line derived from one of the red mouse mela- 
nomas, we observed that forskolin upregulated the expression of the 
melanocyte-specific isoform of Mitf (M-Mitf), and produced a marked 
increase in expression of the Dct and Tyrp1 pigment genes, consistent 
with the ability of the cells to respond to melanocytic differentiation 
signals (Fig. 3a). 

To determine whether the melanoma cells were dependent on 
the presumed oncogenic driver BRAFY°”, we tested their response 
to small molecule inhibitors of BRAF or MEK (also known as 
MAP2K). Treatment with the oncogenic BRAF inhibitor, PLX4720, 
or the MEK inhibitor, U0126, prevented melanoma cell proliferation 
in vitro, and PLX4720 blocked tumour cell growth in vivo, consistent 
with a dependency of these tumours on the BRAFY°””* oncoprotein 
(Fig. 3b, c). BRAF inhibition also elevated the expression of melano- 
cytic genes as previously reported in human melanomas (Fig. 3d)”. 
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Figure 2 | Melanomas on all three pigmentation variants are 
morphologically similar and exhibit common histologic features. 

a-c, Melanomas on black (a), albino (b) and red (c) mice are grossly 
amelanotic. d-f, Histologically, black (d), albino (e) and red (f) melanomas are 
also mostly amelanotic, although superficial tumour cells in black-BRAFY °°" 
tumours carry melanin (original magnification, X10). g, Red-BRAFY °°" 
melanomas can also carry pigment (arrows) (original magnification, x20). 

h, Further magnification of two red melanomas also illustrates pigmented 
tumour cells (original magnification, < 100). i, Forskolin induces epidermal 
pigmentation (arrowheads) and mild tumour cell pigmentation (arrows) 
(original magnification, X20). j, Tumour cells stain positive for $100 (original 
magnification, X10). k, Skin-draining lymph nodes carry gp100* cells (red) 
(scale bar, 10 um). 


Because inactivating mutations in Mc1r alter cAMP levels in the cell, 
red mice undoubtedly have numerous intracellular pathway differ- 
ences relative to wild-type McIr’”” (black) animals, including altered 
DNA repair’. We therefore wished to study whether the pheomelanin 
pigment pathway itself has an intrinsic mechanistic role, or whether it 
is merely a marker of melanoma risk. To investigate this question we 
introduced the albino tyrosinase (Tyr) allele into the red McIr““ 
background to test melanoma incidence in albino-Mclr” animals, 
which retain low MCIR activity and also lack all pigment production 
(Fig. 4a). A melanocyte-targeted LacZ transgene was used to confirm 
that the albino allele does not alter melanocyte number in these mice 
(Supplementary Fig. 3a, b)’®. As shown in Fig. 4b, the albino allele 
profoundly protected red mice from melanoma. The rare albino- 
Mcir** melanomas occurred after long latency and had the same 
amelanotic, $100, histologic features as the other pigmentation variant 
Braf animals (Supplementary Fig. 4a-c). This observation suggests 
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Figure 3 | Tumour cells from a red-Braf“ animal behave like classic 


BRAFY°” melanomas after cAMP upregulation or BRAF inhibition. 

a, Forskolin (20 uM) upregulates expression of melanocytic markers (n = 4). 
b, MAPK inhibition by PLX4720 or U0126 decreases melanoma cell 
proliferation (n = 3). c, PLX4720 blocks melanoma growth in vivo (n = 3). 

d, PLX4720 (2 1M) upregulates expression of melanocytic markers. Relative 
mRNA expression normalized to 18S ribosomal RNA and 0h time point. Error 
bars denote s.e.m. 


that the pheomelanin synthesis pathway is necessary for the high rate of 

ultraviolet-radiation-independent melanoma in the red mice. 
Previous studies have demonstrated that ultraviolet radiation 

amplifies reactive oxygen species (ROS) production and subsequent 
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oxidative DNA damage in the skin of pigmented mice’’. Cells with 
high pheomelanin levels that receive ultraviolet radiation have been 
found to carry particularly high levels of oxidative damage**. Because 
darkly pigmented individuals carry both pheomelanin and eumelanin, 
it has been proposed that their lower melanoma risk may result from 
eumelanin intermediates and polymers absorbing ROS and func- 
tioning as in vivo antioxidants””. 

To determine whether ROS-mediated oxidative DNA damage is 
affected by the pheomelanin synthesis pathway, levels of 8,5’-cyclo-2'- 
deoxyadenosine (cdA) and 8,5’-cyclo-2’-deoxyguanosine (cdG) were 
measured in DNA isolated from skin of red-McIr and albino-Mc Ir” 
mice, using a previously reported liquid chromatography-tandem 
mass spectrometric method” (Fig. 4c). These two ROS-mediated 
cyclopurines are unlikely to be artificially induced during sample pre- 
paration and are quite stable*”®. Significantly, replication studies in 
Escherichia coli have shown that S-cdA and S-cdG can lead to A-to-T 
and G-to-A mutations at frequencies of 11% and 20%, respectively”. 
Comparing cyclopurine levels in the skin of various pigmentation- 
variant mice, it was found that the levels of cdA and cdG are signifi- 
cantly higher in skin from red-McIr”* mice compared to skin from 
albino-Mc1r”’ animals (Fig. 4d, e). This observation indicates that 
activation of the pheomelanin synthesis pathway results in increased 
oxidative DNA damage. Correlative evidence for increased cellular 
oxidative stress was also found in the observation that red-McIr’ 
mouse skin carries higher levels of lipid peroxides, a product of 
ROS-mediated lipid damage (Fig. 4f). 

The findings reported here indicate that in the context of oncogenic 
BRAF activation, individuals carrying red hair/fair skin MC1R poly- 
morphisms have an increased risk of melanoma, owing to both poor 
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Figure 4 | The ultraviolet-radiation-independent propensity of red Braf“ 
mice to develop melanoma is dependent on pigment production. 

a, Genotypes of mice studied. b, The albino allele protects Mc1 r°” mice from 
melanoma development (“,eq = 40, Malbino = 48, Malbino-red = 90) Patbino- 
albino-red — 0.308, Piibine tea < 0.0001, Predaipino-red < 0.0001. Cc, ROS react with 
purine nucleosides to produce 8,5’-cyclopurine lesions (cdA shown). 


ele 


d, Selected-ion chromatograms for DNA from albino-Mcir“ mouse skin. The 
insets show the positive-ion MS/MS/MS spectra for unlabelled and labelled 
S-cdA. e, Both diastereomers of cdA and cdG are significantly higher in red- 
Mcir” skin (n = 3). *P < 0.05; **P < 0.01; ***P < 0.001. f, Lipid peroxide 
levels are significantly higher in red-McIr”” skin (n = 3), ***P < 0.0001. 
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protection from environmental carcinogens like ultraviolet radiation, 
and also via intrinsic carcinogenic features of pheomelanin synthesis; 
potentially via pheomelanin itself, an intermediate of pigment syn- 
thesis or a by-product of the pathway. 

In humans, there are multiple MC1R polymorphisms with varied 
perturbation of receptor function that produce a red hair/fair skin 
phenotype; however, a unifying feature of these various polymorphisms 
is a high pheomelanin-to-eumelanin ratio, which is also produced by 
the Mcir*° allele in mice. Recent data intriguingly demonstrated that 
black animals overexpressing hepatocyte growth factor (HGF) who 
receive ultraviolet radiation are at a higher risk of melanoma than their 
albino counterparts*'. Although the present study reveals a small dif- 
ference between black and albino BRAFY°*"-driven melanomas in the 
presence of K14-SCF (Fig. 1d), the effect did not reach statistical sig- 
nificance (P = 0.103), perhaps signifying oncogene-specific differences. 
It seems that the effects of pigmentation and ultraviolet radiation are 
likely to work together in determining melanoma risk. 

The photometer used for our laboratory’s routine calibration (Inter- 
national Light 1400) was unable to detect any measurable ultraviolet 
radiation in our mouse cages during ambient light exposure. However, 
strong epidemiological work links ultraviolet radiation to melanoma, 
and the current data do not diminish the importance of sun exposure 
as a key contributing factor’. In humans, it is likely that the ultraviolet- 
independent effects act in concert with ultraviolet-mediated cellular 
toxicity. In agreement with published studies, ultraviolet radiation at a 
UV-A/UV-B ratio similar to that found in sunlight (10J cm? UV-A 
and 0.65Jcm * UV-B) was found to exacerbate oxidative damage selec- 
tively in red mouse skin as measured by levels of lipid peroxidation*” 
(Supplementary Fig. 5a). Studies are underway to investigate whether 
ultraviolet radiation is able to alter the red-BRAFY°* tumour phe- 
notype. Preliminary studies examining the effect of visible light 
(180 J cm”) did not reveal significantly altered lipid peroxidation in 
any pigmentation context (P = 0.4506). Perhaps, however, there is a 
trend towards an increased level of lipid peroxidation in red mouse 
skin (Supplementary Fig. 5b). 

Further evidence suggesting an ultraviolet-radiation-independent 
red hair/fair skin melanoma risk is the observation that although darker- 
skinned individuals have a significantly lower risk of melanoma than 
lighter-skinned individuals, the sun protective factor (SPF, a measure- 
ment of sunburn protection) of darker skin has been estimated at only in 
the range of SPF 2.0-4.0 (ref. 28). In addition, sunscreen (typically SPF 
20-40) has shown weak efficacy in protecting against melanoma, unlike 
its protection against cutaneous squamous cell carcinoma’. There are 
numerous potential explanations for the sunscreen-melanoma data 
including the possibility that ultraviolet radiation shielding may protect 
against only one of several carcinogenic mechanisms—with the intrinsic 
pheomelanin pathway representing an additional contributor to mela- 
nomagenesis via ultraviolet-radiation-independent means. These data 
are not evidence against a role for ultraviolet radiation in melanoma- 
genesis. Indeed, the effect of ultraviolet radiation is likely to exacerbate 
this mechanism, such that ultraviolet radiation shielding and sun- 
screen remain extremely important for skin cancer prevention. 
However, further preventative strategies may be essential to optimally 
diminish melanoma risk in the most susceptible individuals. 


METHODS SUMMARY 

Mice. At 6-10 weeks of age, mice were treated topically with 20 mgml ' tamox- 
ifen for 5 days. For topical darkening, 20% coleus extract was applied as previously 
described’’. For in vivo PLX4720 studies, animals were given ad libitum mouse 
chow containing 2% PLX4720 by weight. All studies and procedures involving 
animal subjects were approved by the Institutional Animal Care and Use 
Committees of Massachusetts General Hospital and Dana-Farber Harvard 
Cancer Center, and were conducted strictly in accordance with the approved 
animal handling protocol. 

Morphological examination. Histology and immunostaining were performed 
according to established protocols using anti-S100 (Dako), anti-DCT (Santa 
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Cruz), anti-C5 MITF (tissue culture supernatant), anti-HMB45 (Santa Cruz) 
and anti-gp100 (Abcam) antibodies. 

Primary cell culture. Primary tumour was digested overnight and grown in 
DMEM. Proliferation was measured by the CellTiter-Glo Luminescent Cell 
Viability Assay (Promega). 

Quantitative RT-PCR. Messenger RNA expression of melanocytic markers was 
determined using intron-spanning mouse-specific primers with SYBR FAST 
qPCR master mix (Kapa Biosystems). 

Measurement of 8,5’-cyclopurine-2'-deoxynucleosides. cdA and cdG levels 
were measured as previously published”. Briefly, nuclear DNA was isolated from 
mouse skin, digested to nucleosides, separated by high-performance liquid chro- 
matography (HPLC) and analysed by liquid chromatography-multi-stage mass 
spectrometry (LC-MS/MS/MS). 

Measurement of lipid peroxidation. Mouse skin was irradiated with ultra- 
violet (10Jcm™* UV-A and 0.65Jcm * UV-B), or visible light (180Jcm™”). 
After homogenization, lipid peroxidation was measured with the OxiSelect 
TBARS Assay Kit (Cell Biolabs). 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Mice. All animals used for breeding were backcrossed a minimum of six genera- 
tions onto the C57BL/6 genetic background (this corresponds to a> 98.4% 
C57BL/6 congenic animal, http://jaxmice.jax.org/support/nomenclature/tutor- 
ial.html). The black (wild type), red (Mcir”) and albino (Tyr °) animals were 
purchased from Jackson Laboratories. K14-SCF animals were acquired from 
T. Kunisada. Genotyping of each litter, including the Tyr-Cre(ER)", Braf“* 
and PTEN"“""* alleles was performed as previously published'™'. At 6-10 weeks 
of age the dorsal fur was trimmed using animal shears with a 0.25 mm head and the 
mice were treated topically with 20 mg ml _' tamoxifen for 5 consecutive days. For 
tumour darkening, a 20% solution of Coleus forskohlii root extract (80 UM for- 
skolin) was topically applied daily as previously described’’. For in vivo PLX4720 
studies, animals were given ad libitum mouse chow containing 2% PLX4720 by 
weight or control chow acquired from Plexxikon. All studies and procedures 
involving animal subjects were approved by the Institutional Animal Care and 
Use Committees of Massachusetts General Hospital and Dana-Farber Harvard 
Cancer Center and were conducted strictly in accordance with the approved 
animal handling protocol. 

Dissection and histology. Tissues of interest were photographed, excised, 
weighed, rinsed in PBS, fixed in 10% neutral-buffered formalin, rinsed in PBS, 
and stored in 70% ethanol. Formalin-fixed tissues were paraffin embedded (FFPE) 
and sectioned (3-5 jim) using standard procedures. Morphological analysis was 
performed using multiple independent samples per site/organ (5 to 9 samples per 
genotype) as well as >6 animals. Two pathologists (J.K.L., M.P.H.) independently 
examined the histopathology of the tumour samples. Digitization and image 
capture was performed using an Olympus DP70 digital camera (Olympus) con- 
nected to an Olympus BX51 light microscope or a Scanscope whole-slide scanning 
system (Aperio). 

Immunohistochemistry. For immunohistochemistry, sections were deparaffinized 
with xylene and hydrated with a graded series of alcohol. Sections were boiled in 
50mM Tris-buffer (pH 9) or citrate for antigen retrieval and rinsed in PBS. 
Sections were blocked in 1% BSA, 0.1% Triton X-100 PBS, incubated with 1:200 
dilutions of rabbit anti-S100 (Dako), 1:100 dilutions of goat anti-DCT (Santa 
Cruz), 1:200 dilutions of mouse anti-HMB45 (Santa Cruz) and 1:200 dilutions 
of mouse anti-gp100 (Abcam) antibodies, followed by visualization with appro- 
priate secondary antibodies conjugated to Alexa594 or Alexa488 (1:500). 
Appropriate controls for specificity of staining were included and images were 
captured using an upright fluorescence microscope (Eclipse 90i, Nikon). To 
identify epidermal melanocytes, skin from reporter mice carrying the various 
pigmentation alleles and the K14-SCF transgene as well as a DCT-LacZ reporter 
allele was cryosectioned and stained with 5-bromo-4-chloro-3-indolyl-f-p-galac- 
toside (X-gal) and nuclear fast red counterstaining. 

Primary cell culture. Tumour cells were digested overnight in 10 mg ml! col- 
lagenase and 1 mg ml" hyaluronidase. Initially tumour cells were grown in RPMI 
media with HEPES and 20% serum. Subsequently tumour cells were grown in 
DMEM media with 10% serum. Proliferation after 72 h of PLX4720 (Chemietek) 
and U0126 (Cell Signaling) was determined by the CellTiter-Glo Luminescent Cell 
Viability Assay (Promega). 

Quantitative RT-PCR. RNA was collected from primary cultured tumour cells 
treated for varying times with forskolin or PLX4720 using the RNeasy Plus mini kit 
(Qiagen). mRNA expression of melanocytic markers was determined using 
intron-spanning mouse-specific primers with Kapa SYBR FAST qPCR master 
mix (Kapa Biosystems). Expression was normalized to 18S rRNA and 0h time 
points. Primer sequences used: Mitf forward GCCTGAAACCTTGCTATGC 
TGGAA, Mitf reverse AAGGTACTGCTTTACCTGGTGCCT, Dct forward 
AGGTACCATCTGTTGTGGCTGGAA, Dect reverse AGTTCCGACTAATCA 
GCGTTGGGT, Tyrp1 forward TGGGGATGTGGATTTCTCTC, Tyrp1 reverse 
AGGGAGAAAGAAGGCTCCTG, 18S forward AGGTTCTGGCCAACGG 
TCTAG, 18S reverse CCCTCTATGGGCAATTTT. 

Extraction of nuclear DNA from mouse skin tissues. Nuclear DNA was isolated 
from mouse skin using a high-salt method. Tissues were ground under liquid 
nitrogen into fine powder using a mortar and pestle. A nuclei lysis buffer con- 
taining 20 mM Tris (pH 8.3), 20 mM EDTA, 400mM NaCl, 1% SDS (w/v) and 
0.05% proteinase K (w/v) was added to the tissue and incubated in a water bath 
at 55°C overnight. Half volume of saturated NaCl solution was added to the 
digestion mixture, incubated at 55°C for 15 min then centrifuged at ~12,000g 


for 30 min. The supernatant was collected and centrifuged again. The nucleic acids 
in the supernatant were precipitated with cold ethanol, dissolved in water and 
incubated in the presence of 0.03% RNase A (w/v) and 0.25 U ul” of RNase T1 at 
37 °C overnight, and subsequently extracted with an equal volume of chloroform/ 
isoamyl alcohol (24:1, v/v) twice. The DNA was then precipitated from the 
aqueous layer by cold ethanol, centrifuged at 10,000g at 4°C for 15 min, washed 
twice with 70% cold ethanol and dried under vacuum. The DNA pellet was 
dissolved in deionized water and quantified by using ultraviolet absorption 
spectrophotometry. 

Enzymatic digestion of nuclear DNA. Nuclease P1 (16 U), phosphodiesterase 2 
(0.025 U), 20nmol of erythro-9-(2-hydroxy-3-nonyl) adenine (EHNA) and a 
30 ul solution containing 300 mM sodium acetate (pH 5.6) and 10 mM zinc chlo- 
ride were added to 200 pig of DNA. In this context, EHNA served as an inhibitor 
for deamination of 2'-deoxyadenosine to 2'-deoxyinosine (dI) induced by adenine 
deaminase. The above digestion was continued at 37 °C for 48 h. To the digestion 
mixture were then added alkaline phosphatase (10U), phosphodiesterase 1 
(0.0125 U) and 60 pl of 0.5 M Tris-HCl buffer (pH 8.9). The digestion was con- 
tinued at 37 °C for 2 h and subsequently neutralized by addition of formic acid. To 
the mixture were then added uniformly 15N labelled standard lesions, which 
included 400 fmol of R-cdG, 150 fmol of S-cdG, 80 fmol of R-cdA and 40 fmol 
of S-cdA. The enzymes in the digestion mixture were subsequently removed by 
chloroform extraction twice. The resulting aqueous layer was subjected to off-line 
high-performance liquid chromatography (HPLC) separation for the enrichment 
of the lesions under study, following our previously described procedures”*. 
Liquid chromatography-multi-stage mass spectrometry (LC-MS/MS/MS) 
analysis. The LC-MS/MS/MS experiments were conducted using an LTQ linear 
ion trap mass spectrometer using our recently described conditions”. Briefly, the 
amounts of cdA and cdG lesions in each nucleoside sample were calculated based 
on the ratios of peak areas found in the selected-ion chromatograms for the analyte 
(for example, the 23.5 min peak in the top panel of Fig. 4d for S-cdA) and the 
corresponding stable isotope-labelled standard (for example, the 23.4 min peak in 
the bottom panel of Fig. 4d for the '*N-labelled S-cdA), the known amount of 
uniformly '°N-labelled standard was added to the nucleoside mixture (for 
example, 40 fmol for S-cdA), and calibration curves. The calibration curves were 
constructed from the same LC-MS/MS/MS analyses of a series of mixtures with 
known compositions of the unlabelled cdA, cdG and constant amounts of 
the corresponding uniformly '°N-labelled standards, as described previously”. 
The lesion formation frequencies as shown in Fig. 4e were then calculated by 
dividing the amounts of cdA and cdG in the sample with the total amount of 
nucleosides present. 

Skin irradiation and lipid peroxide measurement. Six-week-old mice were 
euthanized and fur was removed using animal shears with a 0.25mm head. 
Twelve sections of skin, each with an area of 1cm*, were removed from each 
mouse and placed in 35-mm dishes on ice after adherence to Whatman filter 
paper suspended in PBS. For each ultraviolet and visible light study, six sections 
of skin from each mouse were placed in the dark on ice as controls. For ultraviolet 
studies, six sections of skin from each mouse were irradiated on ice with 10J cm” 
UV-A and 0.65Jcm * UV-B at an irradiance of 6.67 mW cm 7 using a Sylvania 
350 Blacklight (Osram Sylvania). This ultraviolet distribution is comparable to 
natural sunlight (96.65% UV-A and 3.35% UV-B). Two mice of each pigmentation 
type were used for a total of n = 12 skin samples for each condition. For visible 
light studies, six sections of skin from each mouse were irradiated on ice with 
180Jcm * visible light from a Dolan-Jenner A3200 Fiber-Lite Illuminator at 
an irradiance of 200mWcm *. The illuminator bulb was fit with a Thorlabs 
FEL0400 Edgepass ultraviolet filter with a transmission of <0.001% for wave- 
lengths <400 nm, such that no irradiation output was detectable in the ultraviolet 
range below 400nm. One mouse of each pigmentation type was used for a 
total of six skin samples for each condition. Following treatment, skin sections 
were flash-frozen and homogenized in PBS containing the antioxidant butylated 
hydroxytoluene (BHT) to prevent further lipid peroxidation, using a Qiagen 
TissueLyser I]. Homogenized samples were centrifuged and supernatants were 
collected. Protein content of each sample was determined by Coomassie Plus 
Protein Assay, and samples were diluted with PBS plus BHT for normalization 
of sample concentration (Thermo Scientific). Lipid peroxidation of each irradiated 
set of sample was determined using an OxiSelect TBARS Assay kit and normalized 
to its unirradiated control (Cell Biolabs). 
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Long non-coding antisense RNA controls Uchll 
translation through an embedded SINEB2 repeat 


Claudia Carrieri!*, Laura Cimatti!*, Marta Biagioli’?, Anne Beugnet®, Silvia Zucchelli’*, Stefania Fedele', Elisa Pesce’, 
Isidre Ferrer*, Licio Collavin®’®, Claudio Santoro’, Alistair R. R. Forrest®, Piero Carninci®, Stefano Biffo*”’, Elia Stupka’® 


& Stefano Gustincich!? 


Most of the mammalian genome is transcribed’*. This generates a 
vast repertoire of transcripts that includes protein-coding messenger 
RNAs, long non-coding RNAs (IncRNAs) and repetitive sequences, 
such as SINEs (short interspersed nuclear elements). A large per- 
centage of ncRNAs are nuclear-enriched with unknown function‘. 
Antisense IncRNAs may form sense-antisense pairs by pairing with 
a protein-coding gene on the opposite strand to regulate epigenetic 
silencing, transcription and mRNA stability’. Here we identify a 
nuclear-enriched IncRNA antisense to mouse ubiquitin carboxy- 
terminal hydrolase L1 (Uchl1), a gene involved in brain function 
and neurodegenerative diseases''. Antisense UchI1 increases UCHL1 
protein synthesis at a post-transcriptional level, hereby identifying a 
new functional class of IncRNAs. Antisense Uchl1 activity depends 
on the presence of a 5’ overlapping sequence and an embedded 
inverted SINEB2 element. These features are shared by other natural 
antisense transcripts and can confer regulatory activity to an arti- 
ficial antisense to green fluorescent protein. Antisense Uchl1 func- 
tion is under the control of stress signalling pathways, as mTORC1 
inhibition by rapamycin causes an increase in UCHLI protein that is 
associated to the shuttling of antisense Uchl1 RNA from the nucleus 
to the cytoplasm. Antisense Uchl1 RNA is then required for the 
association of the overlapping sense protein-coding mRNA to active 
polysomes for translation. These data reveal another layer of gene 
expression control at the post-transcriptional level. 

To discover non-coding antisense transcripts of sense—antisense 
(S-AS) pairs expressed in the brain, the mouse syntenic loci of genes 
involved in neurodegenerative diseases were identified computationally 
and examined in the Ensembl browser (http://www.ensembl.org). The 
FANTOM2 clone 6430596G22 was classified as a spliced antisense 
IncRNA of the Uchl1 gene”’; we refer to this as antisense Uchl1. UCHL1 
is a neuron-restricted protein that acts as a deubiquitinating enzyme, 
ubiquitin ligase or monoubiquitin stabilizer’. An in-frame deletion in 
the Uchl1 gene, as in gracile axonal dystrophy mice, leads to ataxia and 
axonal degeneration. Although an association of UCHL1 gene muta- 
tions to familial Parkinson’s disease has not been confirmed in inde- 
pendent families, oxidative inactivation of UCHLI protein has been 
reported in Parkinson’s disease and Alzheimer’s disease brains’. 

Antisense Uchl1 is a 5' head-to-head transcript that initiates within 
the second intron of Uchl1 and overlaps the first 73 nucleotides of the 
sense mRNA including the AUG codon. By 5’ rapid amplification of 
cDNA ends (RACE), the transcriptional start site (TSS) of antisense 
Uchl1 was mapped to the second intron of Uchll1 (Fig. 1a). The non- 
overlapping part of the transcript contains two embedded repetitive 
sequences, SINEB1 of the F1 subclass (Alu) and SINEB2 of the B3 
subclass, identified by Repeatmasker’*'*. The FANTOM2 clone spans 


a genomic region of 70 kilobases (kb) (Fig. 1a) and its genomic organi- 
zation is conserved in mammals (Supplementary Fig. 1a). 

Sense and antisense Uchl1 expression in mouse and human tissues 
was similar (Fig. 1b and Supplementary Fig. 1b). In the mouse, antisense 
Uchl1 RNA was highly expressed in the ventral midbrain (Fig. 1b) and 
in the MN9D dopaminergic cell line (data not shown). Mature Uchl1 
mRNA was observed mainly in the cytoplasm of dopaminergic neu- 
rons, whereas antisense Uchl1 was enriched in the nucleus (Fig. 1c and 
Supplementary Fig. 2). Antisense Uchl1 expression was confirmed by 
qRT-PCR from dopaminergic neurons purified with laser capture 
microdissection (LCM, Supplementary Fig. 3). 


Uchl1 ——> 


a 
Alu SINEB2 I-11 
tt i 


46 kb 


<< AS Uchi1 
RACE 


Chr 5: 67017694-67078344 


Uchi1 


AS UchI1 
c TH UchlI1 AS UchI1 Overlay 


Figure 1 | Expression of antisense UchI1 in dopaminergic neurons. a, Uchl1/ 
antisense (AS) Uchl1 genomic organization. Uchl1 exons are in black; 3’ and 5’ 
UTRs are in white; antisense Uchl1 exons are grey; repetitive elements are in red 
(Alu) and blue (SINEB2). Introns are indicated as lines. b, Quantitative 
expression of Uchl1 and antisense Uchl1 in mouse tissues (AACt/AACtmax). 
CER, cerebellum; CTX, cortex; OB, olfactory bulb; STR, striatum; VM, ventral 
midbrain. c, Antisense Uchl1 (red) and Uchll (green) transcripts are expressed 
in the nucleus and cytoplasm of TH-positive dopaminergic neurons of the 
substantia nigra (blue). 
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Transient expression of antisense Uchl1 in MN9D cells caused no 
significant change in endogenous Uchl1 mRNA expression. Notably, a 
strong and reproducible upregulation of UCHL1 protein was detected 
within 24 h (Fig. 2a). When increasing amounts of antisense Uchl1 were 
co-transfected with murine Uchl1 into HEK cells, which do not express 
either transcript, dose-dependent UCHLI protein upregulation was 
recorded in the absence of any significant change in the quantity of 
exogenous Uchl1 mRNA (Fig. 2b). These data indicate that antisense 
Uchl1 regulates UCHL1 expression at a post-transcriptional level. 

Antisense Uchl1 deletion constructs lacking the 5’ first exon (antisense 
Uchl1(A5')) or the last three exons (antisense Uchl1(A3')) failed to 
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Figure 2 | Antisense UchI1 regulates UCHLI protein levels via an embedded 
inverted SINEB2 element. a, Antisense Uchl1-transfected dopaminergic 
MN9D cells show increased levels of endogenous UCHL1 protein, with 
unchanged mRNA quantity. b, Increasing doses of transfected antisense Uchl1 
titrate UCHLI protein but not mRNA levels in HEK cells. Data in a and 

b indicate mean + s.d., n = 3.¢, Full-length (FL) antisense Uchl1 is required for 
regulating endogenous (MN9D cells, left panel) and overexpressed (HEK cells, 
right panel) UCHLI protein levels. Scheme of A5’ or A3’ deletion mutants is 
shown. d, Inverted SINEB2 is sufficient to control endogenous UCHLI protein 
levels in MN9D cells. Scheme of mutants is shown in 5’ to 3’ orientation. AA, 
AAlu; AS, ASINEB2; AAS, AAlu+SINEB2; Sf, SINEB2 flipped; ASf, 
Alu+SINEB2 flipped. e, A 73-bp overlap (OL) of antisense UchI1 is sufficient to 
increase UCHLI in transfected HEK cells. Scheme of mutant and scramble 
control in 5’ to 3’ orientation. Units for numbers along the left of gels in 

a-e indicate kDa. 
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induce UCHLI protein in MN9D and HEK cells, suggesting that both 
5’ and 3’ components are important to antisense Uchl1 function 
(Fig. 2c and Supplementary Fig. 4a). Targeted deletion of the region 
containing the embedded SINEB2 and Alu repetitive sequences (AAS) 
was also able to prevent UCHLI protein induction. Deletion of each 
repetitive element separately revealed that SINEB2 is the functional 
unit required by antisense Uchl1 for increasing UCHLI protein syn- 
thesis (Fig. 2d). In all cases no change in Uchl1 mRNA level was detected 
(Supplementary Fig. 4b). A mutant with a flipped SINEB2 sequence was 
unable to increase UCHLI protein levels, thus proving the orientation- 
dependent activity of the SINEB2 domain embedded within antisense 
Uchl1 (Fig. 2d). Importantly, an artificial construct containing the 73- 
nucleotide overlapping sequence immediately close to the repetitive 
elements in antisense Uchl1(A5') increased UCHL]I levels as much as 
the full-length clone (Fig. 2e and Supplementary Fig. 4c). 

We then considered whether other SINEB2-containing IncRNAs 
may post-transcriptionally regulate the expression of their protein- 
coding partner, on the basis of similar structural elements. The 
FANTOMS3 collection of non-coding cDNAs was bioinformatically 
screened for natural antisense transcripts that contain SINEB2 ele- 
ments of the B3 subclass in the correct orientation and 5’ head-to-head 
overlapping to a protein-coding gene. This identified 31 S-AS pairs 
similar to the UchI1/antisense Uchl1 structure (Supplementary Fig. 5). 
By sequence alignment, we chose antisense Uxt (4833404H03), antisense 
of ubiquitously expressed transcript (Uxt), as the one with the most 
similar SINEB2 elements (Fig. 3a). Transfection of antisense Uxt in 
MN9D cells elicited an increase of UXT protein level with no change in 
Uxt mRNA (Fig. 3b), indicating that antisense Uxt was similarly able to 
increase protein levels post-transcriptionally. 

These data indicate a model whereby IncRNAs regulate protein syn- 
thesis through the combined activities of two domains. The 5’ antisense 
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Figure 3 | Natural and synthetic antisense IncRNAs increase target protein 
levels. a, Scheme of Uxt/antisense Uxt genomic organization. b, Antisense Uxt 
increases endogenous UXT protein levels (left) without affecting RNA levels 
(right) in transfected MN9D cells. c, Scheme of antisense GFP construct. A5’ 
with repetitive elements (SINEB2, blue; Alu, red) and overlap (green) regions is 
indicated. d, Inverted SINEB2 plus the overlap sequence increase GFP levels in 
transfected cells. Data in b and d indicate mean = s.d., n = 3. 


15 NOVEMBER 2012 | VOL 491 | NATURE | 455 


©2012 Macmillan Publishers Limited. All rights reserved 


LETTER 


region provides specificity for the sense target gene whereas the repeti- 
tive element confers the protein synthesis activation domain. The model 
predicts that by swapping the overlapping sequence one may increase 
the amount of proteins encoded by the mRNAs of choice acting at the 
post-transcriptional level. We thus synthesized a 72-nucleotide-long 
artificial sequence antisense to the AUG-containing region as trans- 
cribed from pEGFP, and inserted it into antisense Uchl1(A5’) to gen- 
erate antisense GFP (Fig. 3c). Co-transfection of antisense GFP with 
pEGFP strongly increased GFP protein but not mRNA levels in HEK 
cells (Fig. 3d). When we pulsed cells with methionine for an hour and 
immunoprecipitated GFP, antisense GFP induced an increase in radio- 
actively labelled, neo-synthesized GFP, without affecting mRNA levels 
(Supplementary Fig. 6). 

To understand how the antisense Uchl1 transcript operates and the 
physiological conditions in which it might act, we assayed several 
stimuli and/or drugs for their ability to modulate UCHLI protein 
expression. Inhibition of mTORCI signalling favoured an increase in 
UCHLI levels in a range from 1.5- to 2.5-fold (Fig. 4a). This effect was 
evident with as low as 20nM rapamycin (Supplementary Fig. 7) and 
was concomitant with dephosphorylation of mTOR targets p70S6K 
and 4E-BP1. Furthermore, the effect was not due to a stabilization 
of the protein, as co-application of cycloheximide decreased UCHL1 
protein levels (Supplementary Fig. 8a). These data are surprising 
because rapamycin impairs formation of the CAP-dependent com- 
plex and hence translation of highly structured mRNAs”. However, 
in agreement with previous reports, in our experimental settings 
mTORC1 inhibition only slightly impairs the global rate of translation 
(Supplementary Fig. 8b). Under these conditions, it has been proposed 
that rapamycin may affect competition among different mRNAs”. If 
so, we proposed that inhibition of the CAP complex formation favours 
the translation of Uchl1 mRNA with a mechanism that requires anti- 
sense Uchll1. 

To test this model, we used two complementary approaches to estab- 
lish a loss-of-function phenotype. First, we downregulated antisense 
Uchl1 levels with short hairpin RNA (shRNA) targeting its promoter 
region. MN9D cells constitutively expressing shRNA for antisense Uchl1 
did not show any changes in UCHLI protein levels upon rapamycin 
treatment, whereas scramble control cells showed UCHLI upregulation 
as in the parental line (Fig. 4b). Dephosphorylation of p70S6K and 4E- 
BP1 proved that rapamycin inhibited mTOR activity as expected. We 
then exploited the dominant-negative property of an antisense Uchl1 
mutant lacking the SINEB2 repeat element. Overexpression of anti- 
sense Uchl1(ASINEB2) inhibited the ability of full-length antisense 
Uchl1 to increase protein levels (Supplementary Fig. 9). Upon rapa- 
mycin treatment, cells stably expressing antisense Uchl1(ASINEB2) 
did not show any UCHLI protein induction despite dephosphoryla- 
tion of mTOR targets (Fig. 4c). These complementary experiments 
prove that functional antisense expression is required for UCHL1 
protein increase elicited by rapamycin. 

Because antisense Uchl1 transcript is enriched in the nucleus of 
dopaminergic neurons, we measured antisense Uchl1 and Uchll RNA 
content in the nucleus and cytoplasm of MN9D cells upon rapamycin 
treatment. As shown in Fig. 4d, rapamycin substantially increased anti- 
sense Uchll1 concentration in the cytoplasmic fraction. This effect was 
confirmed by a concomitant decrease in its nuclear steady-state levels, 
and by the absence of any de novo transcription. The total content of 
primary and spliced transcripts remained constant (data not shown). 
Uchl1 mRNA showed no change in subcellular distribution, de novo 
transcription or total cellular content. These data demonstrate that 
antisense Uchl1 localization can be regulated by the mTOR pathway, 
and its cytoplasmic level correlates with the expression of UCHL1 
protein. 

We therefore monitored the association of Uchl1 mRNA with poly- 
somes to assess the role of translation in antisense Uchl1-mediated 
UCHLI protein induction. Fractionated MN9D cell extracts were 
used to measure the recruitment of Uchl1 mRNA on polysomes by 
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Figure 4 | Antisense Uchl1 mediates UCHLI protein induction by 
rapamycin. a, UCHLI protein level is increased in rapamycin-treated MN9D 
cells. Rapamycin inhibition of mTOR pathway is verified with anti-p-p70S6K 
and anti-p-4E-BP1 antibodies. b, Silencing antisense Uchl1 transcription 
(shRNA) in MN9D cells inhibits rapamycin-induced UCHLI protein level. 
Left, mRNA levels; right, protein levels. c, Deletion of embedded SINEB2 
(ASINEB2) is sufficient to inhibit rapamycin-induced UCHLI protein 
upregulation. d, Antisense Uchl1 translocates to the cytoplasm upon rapamycin 
treatment in MN9D cells. mRNA levels were measured with 5’ or 3’ primers. 
Data in b-d indicate mean + s.d., n= 3. **P< 0.01; ***P < 0.005. 

e, Rapamycin increases Uchl1 mRNA in heavy polysomes; the absorbance 
profile is outlined in the background of each plot. 


qRT-PCR and northern blotting (Fig. 4e and Supplementary Fig. 10). 
In basal conditions, Uchl1 mRNA was associated with translating 
ribosomes. Rapamycin treatment induced a shift of Uchl1 mRNA to 
heavier polysomes, consistent with an enhanced rate of translation 
initiation; this increase of Uchl1 mRNA association to heavier poly- 
somes did not occur in cells overexpressing the dominant-negative 
form of antisense Uchl1 (Supplementary Fig. 11). 

Antisense Uchl1 is the representative member of a new functional 
class of IncRNAs that are part of S—AS pairs in the mammalian genome 
that require overlap at the 5’ end and the action of a SINEB2 repeat. 
This new function for SINEB2 sequences in the cytoplasm adds to their 
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well-established role in the nucleus as inhibitors of RNA polymerase 
II'®. Stress-dependent nucleocytoplasmic shuttling of lncRNAs may be 
a common strategy to regulate translation, as CTN-RNA, another 
nuclear-retained IncRNA, was found to have a cryptic protein-coding 
sequence at its 3’ end when in the cytoplasm”. 

It is intriguing that this nuclear IncRNA-mediated mechanism for 
post-transcriptional control of gene expression is active when CAP- 
dependent translation is inhibited by rapamycin. This drug blocks 
mTORCI1 kinase, which activates the e[F4F complex’’”*. However, 
some mRNAs escape mTORCI inhibition by being able to be recruited 
to ribosomes in an eI[F4F-independent manner for presenting complex 
mRNA loops that function as internal ribosomal entry sites (IRES)”. 
Indeed, IRES-mediated translation is prominent in conditions of stress 
or growth factor inhibition and its alteration affects processes such as 
tumorigenesis**”*. In genetic and neurochemical models of Parkinson’s 
disease) mTORC1 inhibition protects dopaminergic neurons from 
apoptosis*>”°. 

Antisense IncRNA-mediated translation may be another mechanism 
to maintain synthesis of pro-survival proteins, such as UCHLI, that are 
involved in rapamycin neuroprotective function and more generally in 
cellular response to stress. This mechanism may represent the outcome 
of an evolutionary pressure on the genomic organization of anti-stress 
elements to favour gene-specific regulation of translation when CAP- 
dependent initiation is reduced. Finally, natural and synthetic antisense 
transcripts with embedded repetitive elements may represent molecular 
tools to increase translation of selected mRNAs, defining a potential 
new class of RNA therapeutics. 


METHODS SUMMARY 


5’ RACE for antisense Uchl1 was performed with Gene Racer (Invitrogen). 
Double-fluorescence in situ hybridization of biotin- and digoxigenin-labelled 
probes was detected using fluorochrome-conjugated reagents. Images were cap- 
tured with Confocal Laser Microscopy (LEICA). Expression of antisense Uchl1 was 
performed on neuronal cells lines and dopaminergic neurons purified with LCM 
from TH-GFP mice. Cell lines were cultured under standard conditions. shRNA 
targeting the promoter of antisense Uchl1 was cloned in pSUPERIOR.Neo.GFP 
vector (Oligo Engine). Polysomes were prepared by sucrose gradient and asso- 
ciated Uchl1 mRNA was measured by qRT-PCR and northern blotting. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Oligonucleotides. The complete list of oligonucleotides used for cloning and for 
quantitative real-time PCR experiments is included in Supplementary Information 
(Supplementary Fig. 12). 

Plasmids. Full-length DNA sequence of antisense Uchl1 was amplified via fusion 
PCR starting from RACE fragment and FANTOM clone 6430596G22 (GenBank 
AKO078321.1) with forward mouse antisense Uchl1 FL and reverse mouse antisense 
Uchl1 FL primers. 

Mouse Uchl1 mRNA was subcloned from FANTOM clone 2900059022 
(GenBank AK013729.1) in the unique Pmel site of pcDNA3.1. 

cDNA sequence of human antisense UCHL1 was amplified from a sample of 
human brain total RNA (Clontech, 636530) with the primers human 5’F and 
human 3’R. 

Oligonucleotides that target the sequence —14/+3 around the TSS of antisense 
Uchl1 were annealed and cloned into psUPERIOR.Neo.GFP vector (OligoEngine) 
in the BglII/Xhol site. Scrambled sequence was also cloned and used as control. 

The antisense Uchl1 5’ deletion mutant (A5’) was generated by PCR using the 
oligonucleotides forward mouse antisense Uchl1(A5') and reverse mouse anti- 
sense Uchl1 FL. PCR fragment was cloned in the unique EcoRI site in pcDNA3.1. 

The antisense Uchl1 3’ deletion mutant (A3’) was generated by PCR using the 
forward mouse antisense Uchl1 FL and reverse mouse antisense Uchl1(A3’) primers 
and cloned in the unique EcoRI site in pcDNA3.1. 

The antisense Uchl1(AAS) (AAlu + SINEB2) mutant was obtained by sub- 
sequent cloning of PCR fragment I (NheI-EcoRI site) and PCR fragment II 
(EcoRI-HindIII site) into pcDNA3.1. Primers forward mouse antisense Uchl1 
FL Nhe and reverse pre-SINE B2 EcoRI were used to generate fragment I; primers 
forward post-ALU EcoRI and reverse mouse antisense FL HindIII were used for 
PCR fragment II. 

The antisense Uchl1(AA) (AAlu, 1000-1045) mutant was generated with a 
similar strategy to antisense Uchl1(AAS). Forward mouse antisense Uchl1 FL 
Nhel and reverse pre-ALU EcoRI were used for PCR fragment I; forward post- 
ALU and reverse mouse antisense FL HindIII for PCR fragment II. 

The antisense Uchl1(AS) (ASINEB2, 764-934) mutant was obtained with a 
similar strategy to antisense Uchl1(AAS). Oligonucleotides forward mouse AS 
Uchl1 FL Nhel and reverse pre-SINE B2 EcoRI for fragment I; forward post- 
SINE B2 EcoRI and reverse mouse AS FL HindIII for fragment II. 

For antisense Uchl1(ASf) (Alu + SINEB2 flipped), PCR fragment obtained with 
the primers forward SINEB2 inside and reverse Alu flip was cloned in the unique 
EcoRI site of antisense Uchl1(AAS). 

For antisense Uchl1(Sf) (SINEB2 flipped), PCR fragment obtained with forward 
SINE B2 inside and reverse SINE flip oligonucleotides was cloned in the unique 
EcoRI site of antisense Uchl1 ASINEB2. 

For antisense Uchl1(73 bp), the method of ‘annealing and primer extension’ of 
two 3’-end overlapping oligonucleotides was used to generate the 73-bp antisense 
Uchl1 overlap region. Annealed fragment was obtained with antisense Uchl1 73 bp 
forward and antisense Uchl1 73 bp reverse. Fragment was digested with Xhol and 
EcoRV and ligated into antisense Uchl1 A5’ plasmid. 

The antisense SCR 73-bp mutant was obtained with a similar strategy as anti- 
sense Uchl1(73 bp). The annealing extension was performed with oligonucleotides 
with scramble (SCR) sequence (antisense SCR forward and antisense SCR reverse). 

Full-length mouse antisense Uxt was amplified by PCR starting from FANTOM 
clone 4833404H03 (GenBank AK029359.1) with specific primers (forward mouse 
antisense Uxt and reverse mouse antisense Uxt). PCR fragment was subcloned into 
pcDNA3.1 using XbaI and HindIII restriction enzymes. 

The antisense GFP plasmid was generated with a similar strategy as antisense 

Uchl1(73 bp). Seventy-two base pairs corresponding to nucleotide —40/+32 with 
respect to the ATG of GFP sequence in pEGFP-C2 vector (Clontech) were chosen 
as target sequence for artificial antisense DNA generation. For annealing, the GFP 
antisense forward and GFP antisense reverse primers were used. 
Cells. MN9D cells were obtained from M. J. Zigmond. Cells were seeded in 100-mm 
dishes in Dulbecco’s modified Eagle’s (DMEM) medium containing 10% fetal 
bovine serum (Invitrogen) supplemented with penicillin (50 units ml’) and strep- 
tomycin (50 units ml” '). For experiments, cells were plated in poly-t-lysine (P2636, 
Sigma) coated dishes and grown overnight. Approximately 50% confluent cells 
were treated with 1 uM rapamycin (R0395, Sigma) or DMSO vehicle for 45 min. 

For the establishment of stable cell lines (shRNA -15/+4, shRNA scrambled, 
pcDNA 3.1- and AS Uchl1ASINEB2), MN9D cells were seeded in 100-mm Petri 
dishes and transfected with Lipofectamine 2000 (Invitrogen) according to the 
manufacturer’s instruction. Stable clones were selected by 500 14M neomycin 
(N1142, Sigma). HEK cells (Sigma) were cultured under standard condition in 
DMEM containing 10% fetal bovine serum supplemented with antibiotics. 
Transient transfections were done with Lipofectamine 2000 (Invitrogen). For all 
experiments, S and AS plasmids were transfected at 1:6 ratio. 


Animal handling. All animal experiments were performed in accordance with 
European guidelines for animal care and following SISSA Ethical Committee 
permissions. Mice were housed and bred in SISSA/CBM non-SPF animal facility, 
with 12h dark/light cycles and controlled temperature and humidity. Mice had ad 
libitum access to food and water. C57BL/6 male mice (n = 5), 8-10 weeks old, were 
used for in situ hybridization experiments. Laser capture microdissection (LCM) 
of dopaminergic neurons was performed on 8-10-week-old male TH-GFP/21-31 
mice (n = 3). Intra-cardiac perfusions were done under total anaesthesia. 

RACE and multiplex RT-PCR. The 5’ UTR of antisense Uchl1 was amplified by 
RACE PCR (GeneRacer, Invitrogen) by MN9D total RNA and cloned into pGEM-T 
Easy vector (Promega). 

qRT-PCR. Total RNA was extracted from cells and mouse tissue samples (adrenal 
gland, cerebellum, cortex, olfactory bulb, striatum, ventral midbrain, heart, colon, 
kidney, lung, lymph node, ovary, prostate, spleen, testis, uterus) using Trizol 
reagent (Invitrogen) according to the manufacturer’s instruction. An RNA panel 
of 20 different normal human tissues (pools consist of at least three tissue donors 
with full documentation on age, sex, race, cause of death) was obtained from 
Ambion (AM6000). All RNA samples were subjected to DNase I treatment 
(Ambion). A total of 11g of RNA was subjected to retrotranscription using 
iScript cDNA synthesis kit (BioRad) and Real Time qRT-PCR was carried out 
using SYBR green fluorescence dye (2X iQ5 SYBR Green supermix, BioRad). 
TATA-binding protein (TBP) and RNA polymerase II (RPII) were used as house- 
keeping genes to normalize different mouse and human tissues as tested by the 
GeNorm program, version 3.5 (http://medgen.ugent.be/genorm/)””. GAPDH and 
B-actin were used as normalizing controls in all the other qRT-PCR experiments. 
The amplified transcripts were quantified using the comparative Ct method and 
the differences in gene expression were presented as normalized fold expression 
(AACt). All of the experiments were performed in duplicate. A heat map graphical 
representation of rescaled normalized fold expression (AACt/AACtmax) was 
obtained by using Matrix2png (http://www.bioinformatics.ubc.ca/matrix2png/). 
A list of oligonucleotides used for RT-PCR experiments is in Supplementary 
Fig. 12. 

LCM technology. For LCM, regions of midbrain from TH-GFP/21-31 mice were 
dissected and incubated in 1X Zincfix solution for 6h. They were then cryopro- 
tected in 30% sucrose solution at 4°C overnight, embedded in Neg-50 section 
medium, snap-frozen and left to equilibrate in a cryostat chamber at —21°C for 
1h before sectioning, as described earlier’®. Cryostat 14 1m midbrain coronal 
sections were thaw-mounted on Superfrost plus glass slides (Mezzle-Glasser) 
and dopaminergic GFP * cells were gathered via LCM and collected in microfuge 
(PALM adhesive caps). RNA was immediately extracted using the Absolutely 
RNA Nanoprep kit (Stratagene), eluted in RNase/DNase free water (Ambion) 
and retro-transcribed. 

Two-colour in situ hybridization. After perfusion with 4% formaldehyde, mouse 
brain was cryoprotected overnight in 30% sucrose. In situ hybridization was 
performed on cryostat slices (16 jum). Sense and antisense probes were generated 
by in vitro transcription from the cDNA encoding the distal 600 bp of mouse Uchl1 
cDNA and the last 1,000 bp of mouse antisense Uchl1. The probes for Uchl1 and 
antisense Uchl1 were labelled with digoxigenin (DIG labelling, Roche) and biotin 
(BIO-labelling mix, Roche), respectively. Incorporation of biotin and digoxigenin 
was checked via a northern blot assay. In situ hybridization was performed as 
described previously”’. Slices were pre-treated with 3% hydrogen peroxide for 
30 min. Hybridization was performed with probes at a concentration of 1 pg ml’ 
(Uchl1) and 3 pg ml ! (antisense Uchl1) at 60°C for 16h. For biotinylated RNA 
detection, streptavidin- HRP (Amersham Bioscience) was used (1:250) for 2h in 
TNB buffer (Tris HCl pH 7.5 100 mM, NaCl 150 mM, 0.5% blocking reagent), and 
signals were visualized using the TSA Cy3 system (Perkin Elmer) after washing in 
TNT buffer (Tris HCl pH 7.5 100 mM, NaCl 150 mM, 0.05% Tween-20). In situ 
hybridization on DIG-labelled probe was performed with monoclonal anti-DIG 
antibody after TSA reaction. To combine RNA in situ hybridization with immuno- 
fluorescence, slices were incubated with anti-tyrosine hydroxylase (TH) antibody 
1:1,000 (Chemicon). Signals were then detected with fluorescent dye-conjugated 
secondary antibody goat anti-rabbit 405 and goat anti-mouse 488. Sections were 
then washed, mounted with Vectashield (Vector lab) mounting medium and 
observed with a confocal microscope (Leica). 

Western blot. Cells were lysed in SDS sample buffer 2. Proteins were separated 
in 15% SDS- polyacrylamide gel and transferred to nitrocellulose membranes. 
Immunoblotting was performed with the following primary antibodies: anti- 
UCHLI] (3524 Cell Signaling), anti- UXT (11047-1-AP Proteintech Group), anti- 
p53 (1C12) monoclonal antibody (2524, Cell Signaling) and anti-B-actin (A5441, 
Sigma). For the mTOR pathway: anti-phospho-p70 S6 kinase (Thr 389) (9234), 
anti-phospho-4E-BP1 (Ser 65) (9451), anti-p70 S6 kinase (9202), anti-4E-BP1 
(9452), anti-phospho-Akt (Ser 473) (3787) were all purchased from Cell Signaling. 
Signals were revealed after incubation with recommended secondary antibodies 
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conjugated with horseradish peroxidase by using enhanced chemiluminescence for 
UCHLI (WBKLS0500 Immobilion Western Chemioluminescent HRP substrate) 
and ECL detection reagent (RPN2105, GE Healthcare). 

Protein stability. MN9D cells were seeded in 12-well plates overnight and then 
exposed to 100,1gml | protein synthesis inhibitor cycloheximide (CHX) for 
15 min and rapamycin 1 uM or DMSO vehicle control for the following 45 min. 
Cellular fractionation. Nucleo-cytoplasmic fractionation was performed using 
Nucleo-Cytoplasmic separation kit (Norgen) according to the manufacturer’s 
instruction. RNA was eluted and treated with DNase I. The purity of the cyto- 
plasmic fraction was confirmed by qRT-PCR on pre-ribosomal RNA. 
Polysome profiles. Polysome profiles were obtained using sucrose density gra- 
dients. MN9D cells were treated with 1 jg ml! rapamycin for 35 min, then with 
100 pg ml! cycloheximide for 10 min prior to lysis in 150 ul lysis buffer (50 mM 
Tris-HCl pH 7.5, 100 mM NaCl, 30 mM MgCh, 100 pg ml ~ : cycloheximide, 0.1% 
NP-40, 40 U ml! RNasin, protease inhibitors cocktail). Whole-cell extracts were 
clarified at 4 °C for 10 min at 15,000g. The equivalent of 5-10 absorbance units at 
254 nm of the clarified cell extract was layered onto 15-55% (w/v) sucrose gradient 
(50 mM Tris/acetate pH 7.5, 50 mM NH,Cl, 12mM MgCl, and 1 mM DTT) and 
centrifuged for 3 h 30 min at 39,000 r.p.m. in a Beckman SW41Ti rotor at 4 °C. The 
gradient was pumped out by upward displacement and absorbance at 254 nm was 
monitored using BioLogic LP software (Bio-Rad). One-millilitre fractions were 
collected, 1 ml Trizol reagent (Invitrogen) was added and RNA was extracted 
following the manufacturer’s instructions. A fixed volume of each RNA sample 
was then retro-transcribed and the percentage of mRNA in each fraction was 
calculated as relative Ct value to total RNA. 

Metabolic labelling. MN9D were used for analysis of translational rate. Cells were 
seeded at sub-confluency in 6-well plates, and rapamycin (11M) or DMSO sti- 
mulations were performed for 45min. Cells were labelled with 5 .Ciml~’ of 
[°°S]methionine (Amersham Pharmacia Biotech) for 45 min. Cells were lysed in 
standard lysis buffer (Tris-HCl 20 mM, NaCl 20mM, 0.5% Triton X-100) and 
centrifuged. Supernatants were trichloroacetic-acid-precipitated and filtered on 
glass fibre discs under vacuum. Discs were counted with scintillation fluid in a 
B-counter. Total proteins were measured with the standard BCA method. Rate of 
incorporation was expressed as CPM/total protein ratio (mean ~ s.d.). Experiments 
were done in triplicate. 

Pulse labelling and immunoprecipitation. To monitor de novo protein syn- 
thesis, HEK cells were transiently transfected with pEGFP plasmid (Clontech) 
in combination with antisense GFP or empty control vector. After 24h, medium 
was replaced with methionine/cysteine-free DMEM for 1h. Then, cells were 
labelled with 100 Ci ml of [*°S]methionine/cysteine (EasyTag, Perkin-Elmer) 
for 1h. Labelled cells were collected, lysed in RIPA buffer (150 mM NaCl, 50 mM 
Tris pH 8, 1 mM EDTA, 1% NP40, 0.5% deoxycholic acid and 0.1% SDS) and used 
for immunoprecipitation with anti-GFP antibody (Invitrogen) overnight. 
Immune complexes were isolated with protein G-sepharose beads (Amersham) 
and separated on 10% SDS-PAGE. Newly translated GFP was visualized by auto- 
radiography. Densitometric analysis was performed on high-resolution images 
with Photoshop-CS5. Normalization was obtained relative to input. 
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Northern blot. Polysome fractions were digested with 100 1g ml’ proteinase K 
in 1% SDS and 10 pig glycogen (Invitrogen) at 37 °C for 1 h. RNA was obtained by 
phenol/chloroform extraction and re-suspended in formaldehyde/formamide 
MOPS buffer. Samples were incubated for 5 min at 65 °C before being loaded into 
formaldehyde 1% agarose gel and run at 90 V for 4h at 4°C. RNA was transferred 
onto Amersham Hybond-XL nylon membranes and UV-crosslinked. A radio- 
labelled Uchl1-specific complementary RNA probe was transcribed from the same 
plasmid used for in situ hybridization, performing the reaction in the presence of 
50 uCi of a-**P-UTP (Perkin-Elmer). After treatment with DNase I (Ambion), the 
labelled riboprobe was purified on RNeasy columns (Qiagen). Membranes were 
pre-hybridized for 3 h at 65 °C with NorthernMax prehybridization/hybridization 
buffer (Ambion) supplemented with 50 pg ml! salmon sperm DNA (Invitrogen), 
and hybridized with UCHLI riboprobe overnight at 65 °C in the same buffer. After 
extensive washes (most stringent conditions were 0.2X SSC/0.1% SDS at 65 °C), 
membranes were exposed to autoradiography at — 80 °C with intensifying screens. 
Bioinformatic analysis. For the identification of additional translational activator 
candidates, we searched for FANTOM3 full-length cDNAs that were non-coding 
RNAs and overlap the 5’ end of coding transcripts in a head-to-head configuration. 
The filtered set of 8,535 FANTOM3 ncRNA transcripts described previously”? was 
used as our starting point. Genomic locations of these ncRNA transcripts and 
RefSeq’! coding transcripts were extracted from the alignments in the UCSC 
Genome browser* to identify a set of 788 coding-sense-non-coding-antisense 
pairs. ncRNAs were then checked by RepeatMasker to identify SINEB2-related 
sequences (http://www.repeatmasker.org). This analysis reduced the number of 
pairs to 127 protein-coding transcripts with overlap at the 5’ end (60 with a sense 
strand version of the repeat, 53 with an antisense version and 14 with both sense 
and antisense versions). 

Alignment of the SINEB2-related elements was then carried out using Clustalw 
(http://www.ebi.ac.uk/Tools/clustalw2/index.html). From this analysis the anti- 
sense overlapping transcripts with a repeat most similar to the one of antisense 
Uchll as well as in the same orientation were chosen for experimental testing 
(antisense Uxt). 

Statistical analysis. Statistical analyses were performed with paired two-tailed 
Student’s t-test. Results are mean (n = 3) + standard deviation (s.d.). 
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Cancer cells exhibit several unique metabolic phenotypes that are 
critical for cell growth and proliferation’. Specifically, they over- 
express the M2 isoform of the tightly regulated enzyme pyruvate 
kinase (PKM2), which controls glycolytic flux, and are highly depen- 
dent on de novo biosynthesis of serine and glycine”. Here we describe 
a new rheostat-like mechanistic relationship between PKM2 activity 
and serine biosynthesis. We show that serine can bind to and activate 
human PKM2, and that PKM2 activity in cells is reduced in response 
to serine deprivation. This reduction in PKM2 activity shifts cells toa 
fuel-efficient mode in which more pyruvate is diverted to the mito- 
chondria and more glucose-derived carbon is channelled into serine 
biosynthesis to support cell proliferation. 

Metabolic fluxes in cancer cells are different from those in non- 
transformed cells’. In particular, a shift from oxidative phosphorylation 
to aerobic glycolysis has been demonstrated, which is promoted by the 
M2 isoform of pyruvate kinase’. PKM2 catalyses the final step of gly- 
colysis, converting phosphoenolpyruvate (PEP) to pyruvate (Sup- 
plementary Fig. 1). Interestingly, PKM2, which is the predominant 
isoform in cancer cells*”, has low basal enzymatic activity compared 
to the constitutively active splice variant PKM1 (ref. 6). Another meta- 
bolic pathway recently demonstrated to be crucial for cancer cell sur- 
vival is the serine-biosynthesis pathway’ °. We investigated a potential 
mechanistic link between the two pathways in cancer cells, whereby a 
reduction in overall pyruvate kinase activity through the preferential 
expression of PKM2 would cause the build-up of glycolytic intermedi- 
ates for channelling into the serine-biosynthetic pathway. To test this 
hypothesis, we used human colon carcinoma HCT116 cells, which 
predominantly express the PKM2 isoform (Fig. la and Supplemen- 
tary Fig. 2). Two discrete short hairpin RNA (shRNA) pools were used 
to generate two independent HCT116-derived cell lines, shPKMa 
and shPKMb, in which the expression of both the PKM1 and PKM2 
isoforms was simultaneously and stably silenced (Fig. la and Sup- 
plementary Fig. 2c). Despite achieving a greater than 90% reduction 
in PKM1 and PKM2 (hereafter termed PKM for simplicity) messenger 
RNA and protein levels, compared to cells expressing non-targeting 
shRNA (shCntrl), no compensatory transcriptional induction of the 
PKL or PKR isoforms was observed in the shPKM cells (Supplemen- 
tary Fig. 2). In line with this, liquid chromatography—mass spectrometry 
(LC-MS) analysis of the steady-state levels of metabolites revealed a 
100-fold increase in PEP concentration in shPKM cells accompanied 
by a ~50% decrease in pyruvate levels, demonstrating a reduction in 
intracellular pyruvate kinase activity (Fig. 1b). The stable silencing of 
PKM in HCT116 cells did not alter cell proliferation rates or steady-state 
levels of ATP (Fig. 1c, d). By contrast, the proliferation rates of HT29 
and SW620 colon cancer cells were more sensitive to PKM silencing 
(Supplementary Fig. 3a). Regardless of the effect on cellular proliferation 
rates, PKM silencing universally increased the oxygen-consumption 


rates by ~30%, with a corresponding decrease in extracellular- 
acidification rates, indicators of increased oxidative phosphorylation 
and decreased glycolysis, respectively (Supplementary Fig. 3b, c). 
Because pyruvate kinase catalyses an important ATP-producing step 
in glycolysis, the stability of intracellular ATP levels could be explained 
by this compensatory increase in oxidative phosphorylation in res- 
ponse to PKM silencing. Thus, despite the predominant expression of 
PKM2 in HCT116 cells, these cells still exhibit sufficient pyruvate 
kinase activity to convert PEP to pyruvate and to facilitate aerobic 
glycolysis. 

Whereas PKM silencing caused a large increase in PEP concentra- 
tion, pyruvate levels were decreased to a lesser extent (Fig. 1b). There 
are several possible explanations for this. First, residual PKM could still 
generate pyruvate, albeit at a lower rate. Second, pyruvate can be 
synthesized from carbon sources other than glucose. Finally, pyruvate 
can also be generated from PEP through a pyruvate-kinase-independ- 
ent mechanism”, although this alternative pathway was not increased 
in the knockdown cells (Supplementary Fig. 4). 

In order to study the fate of glucose in PKM-inhibited cells, shCntrl 
and shPKM cells were incubated in media containing uniformly '*C- 
labelled glucose (U-'*C-glucose) and cells were extracted at different 
time points. Several glucose-derived metabolites were tracked by 
LC-MS (Fig. 2 and Supplementary Fig. 5), including pyruvate and 
PEP. The ratio between these two metabolites at an early time point 
after glucose labelling was validated as a reliable measure of PKM2 
activity using an activator of PKM2 (Supplementary Figs 5 and 6 and 
Supplementary Discussion). 

In the cytosol, pyruvate is metabolized to lactate by lactate dehydro- 
genase (LDH) and the resulting lactate contains three glucose-derived 
carbons. In addition, pyruvate is translocated to the mitochondria 
where it is oxidized and de-carboxylated to acetyl-CoA, which enters 
the tricarboxylic acid (TCA) cycle to form citrate, contributing two 
carbon atoms from glucose. When cells were incubated with U-'°C- 
glucose, both glucose-derived lactate and citrate were detected by 
LC-MS ('°C;-lactate and '°C)-citrate). Blocking PKM activity shifted 
the metabolism of glucose away from lactate production in the cytosol 
to citrate production in the mitochondria (Fig. 2 and Supplementary 
Fig. 5). Heavier isotopomers of citrate (particularly 13C,-citrate) were 
detected in cells incubated for a longer time period (4h) with U-8C- 
glucose, an indication of further oxidation and generation of citrate in 
the TCA cycle (Supplementary Fig. 7). These results are in line with the 
observed increase in oxygen consumption of shPKM cells (Sup- 
plementary Fig. 3). 

An increased metabolic flux into the serine and glycine biosynthetic 
pathway was also observed in cells with reduced pyruvate kinase acti- 
vity, as determined by the accumulation of glucose-derived ‘°C in 
these amino acids (Fig. 2 and Supplementary Fig. 5). This is the first 
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Figure 1 | Characterization of PKM-silenced HCT116 cells. a, PKM1 and 
PKM2 protein levels were detected by western blot. Actin was used as a loading 
control. b, Quantified intracellular metabolite concentrations and the uptake or 
secretion of extracellular metabolites in control and shPKM HCT116 cells. For 
extracellular metabolites, the dashed line indicates the initial levels in the 


direct evidence that low pyruvate kinase activity can drive serine and 
glycine biosynthesis, and demonstrates an important link between key 
metabolic processes observed in cancer, namely preferential PKM2 
expression, aerobic glycolysis and serine biosynthesis. The relative 
contribution of glucose-derived carbons to serine and glycine is low 
(Supplementary Fig. 7), which is attributable to the presence of 
unlabelled serine and glycine in the growth media of the cells. Indeed, 
when cells were incubated for 12h in serine- and glycine-free media, 
more than 80% of the intracellular serine and glycine was glucose 
derived (Supplementary Fig. 8a). Nevertheless, de novo synthesis of 
serine and glycine in cells starved of these two amino acids was insuf- 
ficient to recover their steady-state levels (Fig. 3a), indicating a rapid 
use of newly synthesized amino acids. Importantly, the absence of 
extracellular serine and glycine had a pronounced inhibitory effect 
on pyruvate kinase activity, as demonstrated by a 100% increase in 
PEP and a 30% decrease in pyruvate (Fig. 3a). Akin to PKM silencing, 
serine deprivation reduced cytosolic lactate production and increased 
mitochondrial citrate production by ~60% (Fig. 3a). Moreover, cellular 
deprivation of serine and glycine for 12 h followed by 30-min incuba- 
tion with U-'°C-glucose resulted in a 50% decrease in the labelled 
pyruvate/PEP ratio (Fig. 3b). These data indicate that serine and glycine 
deprivation decreases PKM2 activity in cells, such that more glucose- 
derived carbon is channelled into serine and glycine biosynthesis. 
PKM2 is a tightly regulated enzyme that responds not only to the 
availability of PEP and ADP substrates, but also to the upstream 


medium and the graph represents the levels after 24h incubation. c, The 
proliferation rate of the indicated cell lines. d, Intracellular ATP levels 
normalized to protein concentration in the cell extracts. Data are mean = s.e.m. 
of triplicates and are representative of three (two in b) independent 
experiments. *P < 0.05, **P < 0.01, ***P < 0.001. 


glycolytic metabolite fructose-1,6-bisphosphate (FBP) and to phos- 
phorylation events*’*"*. Previous studies have proposed that pyruvate 
kinase activity may be regulated by amino acids'*”. Therefore, the 
ability of serine or glycine to stimulate PKM2 in cells was tested. Serine 
hydroxymethyltransferase converts serine to glycine and vice versa, 
and therefore cells were starved of both amino acids overnight before 
30-min incubation with either serine or glycine. The cells incubated 
with serine or glycine showed an increase in the intracellular levels of 
the added amino acid only, and the levels of the other amino acid were 
unaffected (Supplementary Fig. 8b). When serine- and glycine-starved 
cells were incubated for 30min with serine together with U-*C- 
glucose, intracellular pyruvate kinase activity was increased relative 
to the starved cells. However, glycine did not stimulate intracellular 
pyruvate kinase activity (Supplementary Fig. 8c). 

In order to substantiate the results observed in cells, recombinant 
human PKM2 activity was further analysed in vitro. Serine was 
demonstrated to activate recombinant PKM2 with a half-maximal 
activation concentration (ACs9) of 1.3mM (Fig. 4a), a level that is 
within the physiological range of intracellular serine concentrations 
(Fig. 1b). Isothermal titration calorimetry (ITC) was used to determine 
the dissociation constant (Kj) of the PKM2-serine interaction as 
0.20 mM (Fig. 4b) and the titration curve was consistent with a 1:1 
PKM2-monomer:serine ratio. These results not only demonstrate 
direct interactions between serine and PKM2; they also suggest that 
the serine concentration required for such interactions is well within 
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Figure 2 | The effect of PKM silencing on glycolytic flux. Cells were 
incubated with U-'°C-glucose and the abundance of the main glucose-derived 
metabolite isotopomers was analysed at the indicated time points. The 
cumulative intensities of each labelled metabolite are presented in blue 
(shCntrl) or red (shPKM). The white and black circles under each metabolite 
illustrate '*C and “°C labelling, respectively. All metabolic quantifications are 
presented as mean + s.e.m. of triplicates and are representative of three 
independent experiments. 3-PG, 3-phosphoglycerate; PDH, pyruvate 
dehydrogenase. P values at 4h are shown where significant. 


the physiological range of intracellular serine levels. In agreement with 
the results observed in cells, glycine could not directly activate PKM2 
in vitro (Fig. 4a). Similarly to FBP, serine lowered the Michaelis con- 
stant (Ky) of PKM2 for PEP by 2.3-fold, effectively increasing 
the affinity of PKM2 for this substrate (Fig. 4c). Serine was found 
to be the only standard amino acid that could activate PKM2 (Sup- 
plementary Fig. 9a). A fragment-based crystallographic screen'* 
versus human PKM2 revealed the amino acids L-alanine, L-cysteine, 
L-threonine and L-serine bound to a previously uncharacterized bind- 
ing pocket on PKM2. Crystallographic soaking experiments revealed 
L-serine bound to PKM2, with a single L-serine molecule bound to each 
of the monomers comprising the PKM2 tetramer (Fig. 4d and Sup- 
plementary Tables 1 and 2). L-serine made multiple hydrogen-bonding 
interactions with PKM2 (see Supplementary Fig. 9 and Supplementary 
Discussion). Additional interactions afforded by the serine side-chain 
hydroxyl group rationalize the low affinity of glycine for PKM2 
and may contribute to the unique ability of serine to activate PKM2 
(ref. 16). The amino acid binding pocket was vacant in crystals not 
soaked with serine (Supplementary Fig. 9b). 
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Figure 3 | Serine and glycine deprivation changes glucose metabolism. 

a, HCT116 cells were incubated for 12h with U-°C-glucose in the presence 
(+SG) or absence (—SG) of serine and glycine. The abundance of the main 
glucose-derived isotopomers (black bars) and the unlabelled fraction (white 
bars) of the indicated metabolites were analysed. The circles represent the 
relevant isotopomers as in Fig. 2. b, Intracellular pyruvate kinase activity is 
represented as the ratio between glucose-derived ('*C3-) pyruvate and PEP, 
30 min after labelling with U-'*C-glucose. Results were normalized to shCntrl 
+ SG, All results are presented as mean + s.e.m. of triplicates and are 
representative of two independent experiments. *P < 0.05, **P< 0.01, 
***P < (0.001. 


On the basis of the above observations, a recombinant PKM2(H464A) 
mutant was produced and demonstrated neither direct binding to, nor 
activation by, serine (Fig. 4b, e). These data confirm that the amino- 
acid-binding site identified in the soaking experiments is the only 
amino-acid-binding site on the PKM2 protein. Despite its inability 
to bind serine, the PKM2(H464A) mutant was activated by FBP in a 
similar way to wild-type PKM2. Conversely, a S437Y mutant of PKM2, 
which cannot bind FBP", was activated by serine (Fig. 4e, f). These 
results demonstrate that PKM2 is independently activated by either 
FBP or serine, and that both molecules could contribute to PKM2 regu- 
lation in response to glucose and/or amino acid deprivation in vivo. 
PKM2 regulation by serine in tumours is important as the serine con- 
centration in the blood is ~20-fold below that of glucose, hence under 
an interrupted blood supply, a considerable deficit in serine supply 
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Figure 4 | Serine is an allosteric activator of PKM2. a, In vitro activity of 
recombinant human PKM2 was analysed in the presence of increasing 
concentrations of FBP (@), serine (O) or glycine (A). b, Serine binding to 
PKM2 wild-type (WT) or PKM2(H464A) mutant was measured by ITC. The 
Kg of serine for wild-type PKM2 was measured as 200 11M. No serine binding to 
PKM2(H464<A) was detected. c, The initial PKM2 reaction rate (Vo (relative 
luminescence units (RLU) per min)) was measured at different PEP 
concentrations in the presence of 50 4M FBP (@), 100 mM serine (O) or 
vehicle (A). K,, values for PEP were determined as 1.9 mM, 0.81 mM and 
0.19 mM in the presence of vehicle, serine and FBP, respectively. For a and 

c, results are presented as mean + s.e.m. of three experiments. d, 2.3 A2F,-F. 


would occur. Serine is crucial for multiple metabolic pathways required 
for cell growth and proliferation, including phospholipid, purine and 
glutathione biosynthesis, as well as being a methyl source for single- 
carbon metabolism. In addition, serine levels are depleted at the peri- 
phery of solid tumours, hence de novo serine biosynthesis is critical for 
tumour growth". 

The allosteric regulation of pyruvate kinase is associated with complex 
structural changes'’"*”?. Phosphorylation of Tyr 105, adjacent to Arg 106 
in the serine-binding pocket, has been shown to modulate PKM2 activity 
and FBP binding”, and oxidation of Cys 358, which is proximal to the 
serine-binding pocket, inhibits PKM2 (ref. 23) (Supplementary Fig. 9e). 
This indicates that the PKM2 serine-binding pocket, and proximal resi- 
dues, constitute a key structural regulatory node for PKM2. 

The predominant isoform of pyruvate kinase in cancer cells is 
PKM2; however, the serine-binding site identified here is conserved 
in PKM1, PKL and PKR. Therefore, the possibility that serine may be a 
universal pyruvate kinase regulator was tested in vitro. PKM1 demon- 
strated a high degree of basal activity in the absence of exogenous 
activators, and was refractory to both FBP and serine activation 
(Fig. 4g). By contrast, a recombinant protein corresponding to the 
common portion of PKL and PKR (referred to as PKLR) was com- 
pletely inactive in its basal state and was robustly activated by FBP. 
Notably, PKLR was not activated by serine (Fig. 4h). These results 
indicate that only PKM2-expressing cells can respond to changes in 
serine availability and support shuttling of glucose-derived carbon into 
serine biosynthesis after serine deprivation. 

This work provides a new understanding of the relationship between 
glucose and amino acid metabolism. Serine biosynthesis is an anabolic 
pathway required for growth and proliferation. However, it recruits 
carbon away from the energy-production pathway of glucose utilization. 
Here, to our knowledge, we present for the first time an elucidation of 
the mechanism that tightly controls the metabolic bifurcation of 
glucose-derived carbon. The control of PKM2 activity through serine 


[Compound] (M) [Compound] (M) 


map (purple) contoured at 1o for the final, refined structure of L-serine 
(orange) bound to PKM2 (green). The side chain of His 464, which was 
subsequently mutated to alanine, is shown in yellow. e, f, In vitro activity of 
PKM2 mutants H464A (e) and $437Y (f) was analysed in the presence of 
increasing concentrations of FBP (@) or serine (©). g, h, In vitro activity of 
PKM1 (g) and PKLR (h) was analysed in the presence of increasing 
concentrations of FBP (@) or serine (O). For a and e-h, the signal was 
normalized to controls containing no enzyme. The basal activity of the studied 
pyruvate kinase in each panel in the presence of vehicle control is indicated by 
the dotted line. For e-h, data are presented as the mean of duplicate 
determinations and are representative of three independent experiments. 


availability provides a rheostat-like mechanism. When serine is abund- 
ant, PKM2 is fully active, enabling the maximal use of glucose through 
glycolysis. However, when the steady-state levels of serine drop below a 
critical point, an immediate attenuation of PKM2 activity occurs. This 
enables the fast shuttling of glucose-derived carbon to serine biosyn- 
thesis, compensating for the serine shortfall and enabling growth 
and proliferation in the absence of these amino acids (Supplementary 
Fig. 1). Finally, by activating PKM2, serine supports aerobic glycolysis 
and lactate production, events that are critical for cancer cell growth and 
survival. 


METHODS SUMMARY 


shCntrl and shPKM cells were obtained by infecting cells with control non-target- 
ing shRNA or shPKM lentiviral particles, respectively. mRNA and protein levels of 
pyruvate kinase isoforms were determined by quantitative PCR and western blot. 
Cell proliferation was measured using the Alamar Blue assay. ATP levels were 
measured using a bioluminescent assay kit. Oxygen-consumption rate and extra- 
cellular-acidification rate were measured using a Seahorse XF24. The quantification 
of intracellular and extracellular metabolites was performed by LC-MS. Isotopomer 
distribution of intracellular metabolites after labelling with U-'*C-glucose 
was analysed with the PeakML.Isotope.TargettedIsotopes() function of the 
mzmatch.R library. In vitro PKM2 activity was measured by the quantification of 
ATP production using luminescent Kinase-Glo Plus reagent. ITC experiments were 
performed on a MicroCal VP-ITC isothermal titration calorimeter. His-tagged 
human PKM2 was purified, concentrated, crystallized and then soaked in the 
presence or absence of serine and X-ray diffraction data were collected. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 

Cell culture. HCT116 and HT29 colon cancer cells were maintained at 37 °C and 
5% CO, in high glucose DMEM (21969-035, Invitrogen) supplemented with 10% 
FBS and 2 mM L-glutamine. $W620 colon cancer cells were maintained at 37 °C 
and 5% CO, in RPMI (12633-020, Invitrogen) supplemented with 20% FBS and 
2mM L-glutamine. Stable PKM knockdown and control cell lines were cultured in 
the same media containing additional 2 1g ml! puromycin. 

Stable PKM silencing. HCT116, SW620 and HT29 cells were infected with len- 
tiviral particles containing control shRNA (shCntrlb) (sc-108080) or a pool of 
three PKM shRNAs (shPKMb) (sc-62820) (5’-GATCCCTGTGGCTCTAGA 
CACTAATTCAAGAGATTA-GTGTCTAGAGCCACAGTTTTT-3';_ 5’-GATC 
CGTCTGGAGAAACAGCCAAATTCA-AGAGATTTGGCTGTTTCTCCAGA 
CTTTTT-3’; 5’-GATCCGTCTGGAGAAACAGC-CAAATTCAAGAGATTTG 
GCTGTTTCTCCAGACTTTTT-3’) (Santa Cruz Biotechnology) according to 
the manufacturer’s instructions. Infected cells were selected using 6 jig ml‘ puro- 
mycin and shPKM clones were analysed for PKM1 and PKM2 expression levels 
using western blot analysis and quantitative PCR (qPCR). A different set of plas- 
mids containing PKM shRNA (shPKMa) was bought from Openbiosystems 
(TRCN0000037610) (5’-CCGG-GAAGGGAAAGAACATCAAGAT-CTCGAG- 
ATCTTGATGTTCTTTCCCTTC-TTTTTG-3’) and (TRCN0000037611) (5’- 
CCGG-CGGGTGAACTTTGCCATGAAT-CTCGAG-ATTCATGGCAAAGTTC 
ACCCG-TTTTTG-3’). Scramble shRNA (shScramble) (5'’-CCTAAGGTTAAG 
TCGCCCTCGCTCGAGCGAGGG-CGACTTAACCTTAGG-3’) (Addgene plas- 
mid no. 1864) was used as a control (shCntrla). HCT116 cells were infected with 
both pLKO.1-shPKM or pLKO.1-shScramble and selected using 2 1g ml~' puro- 
mycin for 2 weeks and PKM-silenced clones were analysed for PKM1 and PKM2 
expression levels using western blot analysis and qPCR. 

mRNA extraction and qPCR analyses. 4 X 10° cells were plated in a 6-well plate 
and were lysed after 2 days in RNeasy lysis buffer (Qiagen). Lysates were passed 
through QIAshredder columns (Qiagen) and mRNA was isolated using the 
RNAeasy kit following the manufacturer’s instructions. RNA was quantified 
and quality-controlled using an Eppendorf biophotometer and Eppendorf sin- 
gle-sealed cuvettes, UVette (Eppendorf UK Limited). For PCR analyses 1 pg 
mRNA was retro-transcribed into complementary DNA using High Capacity 
RNA-to-cDNA (AB, Life Technologies Corporation). In brief, 0.5 4M primers, 
1X Fast SYBR Green Master mix (AB, Life Technologies Corporation) and 1 plofa 
1:10 dilution of cDNA in a final volume of 20 1] were used. qPCR was performed 
on the 7500 Fast Real-Time PCR System (Life Technologies Corporation) and 
expression levels of the indicated genes were calculated using the AAC, method by 
the appropriate function of the software using actin as calibrant. The PCR program 
was: 20s at 95 °C followed by 40 cycles of 3 s at 95 °C and 30s at 60 °C. Finally the 
melting curve was performed, which was used to confirm the presence of single 
PCR products. Primers are as follows: f-actin-forward primer: 5’- TCCAT 
CATGAAGTGTGACGT-3’; B-actin-reverse primer: 5’- TACTCCTGCTT 
GCTGATCCAC-3’; PKM1-forward primer: 5’-GAGGCAGCCATGTTCCAC- 
3'; PKM1-reverse primer: 5’-TGCCAGACTCCGTCAGAACT-3’; PKM2-for- 
ward primer: 5’-CAGAGGCTGCCATCTACCAC-3'; PKM2-reverse primer: 
5'-CCAGACTTGGTGAGGACGAT-3’. PKL-forward primer: 5’-CTGGTGATT 
GTGGTGACAGG-3' PKL-reverse Primer: 5’-TGGGCTGGAGAACGTAG 
ACT-3' PKR-forward primer: 5’-CAATTTGGCATTGAAAGTGG-3’; PKR- 
reverse primer: 5’- CCTGTCACCACAATCACCAG-3’. 

Immunoblotting. 4 < 10° cells were plated in a 6-well plate and were lysed after 
2 days in radio-immunoprecipitation assay buffer (RIPA) (150 mM sodium chlo- 
ride, 1.0% NP-40, 0.5% sodium deoxycholate and 50mM Tris, pH 8.0) sup- 
plemented with a 1:100 dilution of the protein inhibitors cocktail (Sigma). Protein 
concentration was determined using the bicinchoninic acid assay (Thermoscientific) 
using BSA as standard (Thermoscientific). Equal amounts of protein were loaded 
into 12% SDS-PAGE gels and electrophoretically separated using Tris-glycine SDS 
running buffer. After SDS-PAGE, proteins were transferred onto 0.22 um nitro- 
cellulose (Millipore) and probed with antibodies, all at 1:1,000 dilution in 5% non- 
fat milk. PKM1 antibody was custom-made by PolyPeptide Laboratories using 
the following peptide sequence: CLVRASSHSTDLMEAMAMGS. The PKM2 (cat 
no. 3198) and PKM (no. 3186) antibodies were purchased from Cell Signaling 
Technology. The anti-actin antibody (mouse monoclonal AC-40) was purchased 
from Sigma. For detection, membranes were incubated with either donkey-anti 
rabbit (926 32213) or donkey-anti mouse (926 32212) secondary antibodies pur- 
chased from Licor, all at 1:1,000 dilution in TBS Tween 0.1%. The infrared scan- 
ning was performed using the Licor Odyssey scanner (channel, 800; brightness, 50; 
contrast, 50; sensitivity, auto; resolution, 169.492 Lm; pixel area, 0.02873; intensity, 
5) and acquired using Odyssey software version 3. Images were then exported as 
TIFF and cropped using Adobe Photoshop CS4. 

Cell proliferation. shPKM and control HCT116, HT29 or SW620 cells were 
seeded into a 96-well plate at a density of 1,000 cells per well in 200 ul of 
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DMEM containing 2% FBS. On days 4 to 8 after seeding, 20 ul of Alamar Blue 
solution (life technologies) was added to each well measured. Cells were incubated 
for 6h and fluorescence was measured using 535nm excitation and 590-nm 
emission wavelengths. 

ATP levels. 6 X 10° cells were seeded on a 6-well plate the day before the experi- 
ment. Cells were then washed twice with PBS in order to remove dead cells, and 
then lysed using the ATP-release buffer (Sigma). ATP was then measured using a 
luciferase-based assay according to the manufacturer’s instructions using the ATP 
bioluminescent somatic cell assay kit FLASC (Sigma). Values were normalized to 
the total protein content of the cell lysate as measured by BCA assay (Thermo- 
scientific) using BSA as standard. 

Measurement of oxygen-consumption rate and extracellular-acidification 
rate. 3 X 10* cells were plated onto XF24 plates in DMEM (10% FBS, 2mM 
glutamine) (Seahorse Bioscience) and incubated at 37°C, 5% CO, overnight. 
The medium was then replaced with 675 1l of unbuffered assay media (Seahorse 
Bioscience) supplemented with 2 mM glutamine, 25 mM glucose and 2% FBS (pH 
was adjusted to 7.4 using sodium hydroxide 0.5 mM) and cells were then placed at 
37°C in a CO)-free incubator for 30 min. Basal oxygen-consuption rate (OCR) 
and extracellular-acidification rate (ECAR) were recorded using the XF24 plate 
reader. At the end of the experiment 1 1M antimycin A was added in order to 
measure mitochondria-independent oxygen consumption. Each measurement 
cycle consisted of 3 min mixing, 3 min waiting and 4min measuring. OCR and 
ECAR were normalized to cell number. To obtain the mitochondrial-dependent 
OCR, only antimycin-A-sensitive respiration was used. Homogeneous plating and 
cell count were assessed by fixing the cells with 10% trichloroacetic acid for 1h at 
4°C and then staining the fixed cells with 0.47% solution of Sulforhodamine B 
(Sigma). 

Quantification of intra- and extracellular metabolites by the standard addi- 
tion method. 1 < 10° cells were plated onto 6-cm plates in triplicates and cultured 
in standard medium (DMEM, 10% FBS, 2 mM glutamine). Two additional plates 
were grown as counter plates. The medium was replaced after 24h by 10 ml of 
fresh standard medium, and cells were incubated for another 24h before extrac- 
tion (as described in the following section). Standard compounds were weighed 
separately and dissolved together in water to make solution A (in which each 
metabolite has a concentration between 1 mM and 10mM). 1 ml of solution A 
was added to 49 ml of dilution solvent (50:50 acetonitrile:water) to make stock 
solution B (in which each metabolite had a concentration between 20 uM and 
200 uM). For quantification, cells or media extracts (200 pil) were mixed with 
800 ul of dilution solvent, containing 0, 4, 20, 100, 300 or 500 ul of stock solution 
B. Dilutions were analysed by LC-MS. The concentration of each metabolite in the 
extract was calculated according to the linear regression fit”. All dilution series 
were performed in triplicates using three biological replicates. 

Measurement of '*C-labelled metabolites by LC-MS. 4 X 10° cells were plated 
onto 6-well plates and cultured in standard medium for 24h. The medium was 
then replaced by 2 ml of fresh medium containing 5 mM unlabelled glucose and 
3h later 5mM of U-?C-glucose (Cambridge Isotope Laboratories) was added; 
alternatively, medium was replaced by 2 ml of fresh medium with U-'°C-glucose 
only. Cells were incubated for the indicated time before extraction. For extraction, 
cells were washed twice in PBS and metabolites were extracted on a dry ice/ 
methanol bath in a 50:30:20 ratio of methanol:acetonitrile:water and quickly 
scraped. The insoluble material was spun down in a cooled centrifuge at 
16,000g for 15min at 0°C and the supernatant was collected for subsequent 
LC-MS analysis. The volume of extraction solution was calculated according to 
cell number and, extrapolated using a ‘counter dish’ cultured under the same 
experimental conditions as the sample dishes. A volume of 1 ml of extraction 
solutions per 2 X 10° cells was used. Metabolites were separated using a liquid 
chromatography system. A ZIC-pHILIC column (4.6 mm X 150 mm, guard col- 
umn 4.6 mm X 10 mm; Merck) was used for liquid chromatography separation 
using gradient elution with a solution of 20 mM ammonium carbonate, with 0.1% 
ammonium hydroxide, and acetonitrile. Detection of metabolites was performed 
using a Thermo Scientific Exactive high-resolution mass spectrometer with elec- 
trospray ionization, examining metabolites in both positive and negative ion 
modes, over the mass range of 75-1,000 m/z. 

2-dimensional gel electrophoresis. shPKM and control HCT116 cells were lysed 
precipitated and resuspended as described in ref. 9. Isoelectric focusing (IEF) was 
performed using ZOOM strips (pH 3-10, non-linear) according to the manufac- 
turer’s instructions (Invitrogen). After IEF, strips were equilibrated in buffer con- 
taining 10mgml~' DTT and subsequently in buffer containing 25mgml ! 
iodoacetamide before SDS-PAGE. As negative control, shPKM HCT116 cells 
were lysed and precipitated in acetone and the pH adjusted to pH 5 by the addition 
of 1% acetic acid. This removed any acid-labile phosphates such as phospho- 
histidine. PGAM1 and its phosphorylated forms were detected by western blot 
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using goat anti-PGAM1 (Novus) 1:1,000 and Donkey anti-Goat (Licor) 1:1,000 
antibodies. 

PKM2 activator. Cells were treated for 1 h with 20 11M of the commercially 
available PKM2 activator 1-(3-chloro-5-trifluoromethyl-pyridin-2-yl)-1H-pyrrole- 
2-sulfonic acid p-tolylamide”® (indicated in this letter as Cmpd1) or with vehicle as 
control, Both were incubated in media containing 25 mM U-'’C-glucose for the 
indicated time. 

In vitro measurement of pyruvate kinase activity. PKM2 was expressed and 
purified as described in the ‘PKM2 X-ray crystallography’ section below. Rabbit 
muscle PKM1 was obtained from Sigma; PKLR was purchased from Abcam. 
Enzyme activity was measured in vitro with a coupled assay quantifying levels 
of ATP using luminescent Kinase-Glo Plus reagent (Promega) as described prev- 
iously in ref. 27 with some modifications. Measurements were performed in the 
presence of 50 mM Tris, pH7.5, 100mM KCl, 10mM MgCl, 200 uM PEP, 
200 uM ADP, 3% DMSO and either 4nM PKM2, 4nM PKM1 or 10nM PKLR. 
Reactions (25 pl) were incubated for 20 min on a shaker before addition of 25 ll 
Kinase-Glo Plus reagent, as per the manufacturer’s instructions. Luminescence 
was read with a PHERAstar (BMG LABTECH). The signal was normalized to the 
no-enzyme controls. Activation curves were fitted to a four-parameter logistic 
equation and K,, curves were fitted to a Michaelis-Menten equation using 
Prism 5 (GraphPad). 

Isothermal titration calorimetry. Isothermal titration calorimetry experiments 
were performed on a MicroCal VP-ITC at 25°C in a buffer comprising 50 mM 
Tris, 100mM KCl, 10mM MgCl, and 1mM Tris(carboxyethyl phosphine) 
(TCEP) at pH7.5. For titrations the L-serine concentration was 5mM in the 
injection syringe and the PKM2 concentration was 281M in the sample cell. 
The protein concentration refers to the monomer. PKM2 was incubated for 
30 min with an excess of FBP (200 1M) before the L-serine titration was performed 
in the presence of FBP. The Kg value for L-serine binding was much greater than 
the PKM2 concentration used, making it difficult to accurately determine the 
stoichiometry value. Therefore, the stoichiometry parameter was fixed at 1 for 
the purpose of data analysis using the single site-binding model in Origin 7.0. 
PKM2 X-ray crystallography. A publicly available human PKM2 expression 
construct was obtained from the Structural Genomics Consortium (SGC). His,- 
PKM2 was purified using nickel-nitrilotriacetic acid affinity capture and HiLoad 
Superdex 16/60 S75 size-exclusion chromatography. Human PKM2 was crystal- 
lized using hanging-drop vapour diffusion. Protein solution (10 mgml', 25mM 
Tris/HCl, pH 7.5, 0.1 M KCL, 5mM MgCl, 10% (v/v) glycerol) was mixed ina 1:1 
ratio with reservoir solution containing 0.1 M KCl, 0.2 M ammonium tartrate, 24% 
(w/v) PEG3350. Crystals were soaked overnight in a solution containing 30 mM 
L-serine, cryo-protected and flash-frozen in liquid nitrogen. X-ray diffraction data 
were collected from a single crystal at 100K at Beamline-103 at the Diamond Light 
Source. Diffraction data were processed using XDS AutoPROC from Global 
Phasing and SCALA (CCP4)**. Molecular replacement was performed using 
model 3H60 (SGC) in CSEARCH” and maximum-likelihood refinement carried 
out using a mixture of automated (see ref. 18 and references therein) and manual 
refinement protocols employing Refmac (CCP4) and AutoBuster from Global 
Phasing. Ligand fitting was performed using Autosolve” and manual rebuilding. 
Simulated annealing was not used. The four PKM2 monomers comprising the 
tetramer in the asymmetric unit were refined as independent entities, but non- 
crystallographic symmetry (NCS) restraints were imposed in AutoBuster using the 
‘ncsauto’ command. Refinement of the structure in the absence of NCS restraints 
gave (R= 23.7, R = 17.7) and with ‘ncsauto’ gave (Ry= 22.7, R = 17.9), showing a 
small, but significant, reduction in Ry using ‘ncsauto’ restraints. At the “effective 
resolution’ of 2.36A there are ~86,000 unique reflections. The refinement 
included ~ 16,600 non-hydrogen atoms. B-factors were refined isotropically giving 
a total of ~66,500 parameters for all non-hydrogen atoms in the PKM2 tetramer. 
The four serine molecules were refined as independent ligands. Details of ligand 
occupancies, B-factors and so on are tabulated in the crystallographic data tables 
(Supplementary Tables 1 and 2)°?’. 

PKM2 mutagenesis. Human PKM2 point mutant constructs were generated 
using a Stratagene QuikChange II site directed mutagenesis kit (no. 200524). 
PCR protocols were as defined in the product manual. The following forward 
DNA primers, and their reverse complemented primer counterparts, were used 
for the mutagenesis reactions (sequence of mutated bases shown in uppercase 
bold): H464A, 5’gctcgtcaggccGCcctgtaccgtggc3’; S437Y, 5’accaagtctggcagetAtgct 
caccaggtgg3’. Primers were purchased from Sigma. The previously described SGC 
human PKM2 construct was used as the DNA template within the PCR reactions. 


The presence of the point mutations was confirmed by DNA sequencing of the 
DNA constructs (Beckman Coulter Genomics Inc.) and in-house LC-MS of the 
purified recombinant proteins. The mutant proteins were expressed and purified 
identically to the wild-type protein. 

Statistical analyses. The data (mean = s.e.m.) are representative of three inde- 
pendent experiments, performed in triplicates (unless otherwise indicated). Data 
were analysed and presented with GraphPad Prism 5.01 software (GraphPad 
Software Inc.). P values were calculated using an unpaired two-tailed t-test. 
Data processing. The data-processing workflow started by first converting the 
vendor specific raw data files from the mass spectrometer into the mzXML open- 
data format*, using the msconvert utility from the ProteoWizard Library and 
Tools collection” (http://proteowizard.sourceforge.net/). The set of all chromato- 
graphic peaks in each of the converted raw files were then extracted using the 
CentWave” feature detection algorithm from XCMS”. The resulting data were 
stored in the PeakML file format”, and the rest of the processing was handled by 
the scriptable mass spectrometry data-processing tool mzmatch.R* (http:// 
mzmatch.sourceforge.net/). 

The next step in the workflow involved aligning and combining the chromato- 
graphic features between biological replicates of a single sample. The PeakML files 
thus created were subjected to an additional filtering procedure to discard all peaks 
that were not reproducibly detected in all biological replicates involved. 
Chromatographic peaks of individual samples were then aligned together on the 
basis of their retention time and m/z values, and combined into a single PeakML 
file. Peak sets that do not include peaks from every sample were filled in by 
extracting ion chromatograms within the retention time and mass window of 
the corresponding peak set directly from the raw data files. From these peak sets, 
only those that had more peaks than the number of replicates minus one were 
selected for further analysis. Putative identification of the peak sets were made by 
matching the detected masses to that of the compounds relevant to this study. 
Isotope peaks were extracted by identifying the peaks that fell in the retention time 
window of the identified unlabelled peak and correspond to the estimated mass 
window (2 p.p.m.) of the isotope. All isotope identification and quantification of 
the ratios were performed by the PeakML.Isotope.TargettedIsotopes() function of 
the mzmatch.R library, and detailed documentation and tutorials for which are 
available at http://www.mzmatch.sourceforge.net/isotopes-targetted.php. 
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A vaccine strategy that protects against genital 
herpes by establishing local memory T cells 


Haina Shin! & Akiko Iwasaki! 


Most successful existing vaccines rely on neutralizing antibodies, 
which may not require specific anatomical localization of B cells. 
However, efficacious vaccines that rely on T cells for protection 
have been difficult to develop, as robust systemic memory T-cell 
responses do not necessarily correlate with host protection’. In 
peripheral sites, tissue-resident memory T cells provide superior 
protection compared to circulating memory T cells**. Here 
we describe a simple and non-inflammatory vaccine strategy that 
enables the establishment of a protective memory T-cell pool 
within peripheral tissue. The female genital tract, which is a portal 
of entry for sexually transmitted infections, is an immunologically 
restrictive tissue that prevents entry of activated T cells in the 
absence of inflammation or infection*. To overcome this obstacle, 
we developed a vaccine strategy that we term ‘prime and pull’ to 
establish local tissue-resident memory T cells at a site of potential 
viral exposure. This approach relies on two steps: conventional 
parenteral vaccination to elicit systemic T-cell responses (prime), 
followed by recruitment of activated T cells by means of topical 
chemokine application to the restrictive genital tract (pull), where 
such T cells establish a long-term niche and mediate protective 
immunity. In mice, prime and pull protocol reduces the spread 
of infectious herpes simplex virus 2 into the sensory neurons and 
prevents development of clinical disease. These results reveal a 
promising vaccination strategy against herpes simplex virus 2, 
and potentially against other sexually transmitted infections such 
as human immunodeficiency virus. 

Viral sexually transmitted infections (STIs) such as human immuno- 
deficiency virus 1 (HIV-1) and herpes simplex virus 2 (HSV-2) account 
for considerable morbidity and mortality around the world. Strong 
preclinical evidence for the role of T cells in controlling viral STIs 
has led to the design of prophylactic vaccines that elicit systemic cellu- 
lar immunity, and yet these vaccines have not been efficacious’®. 
Although systemic memory T cells can migrate freely through organs 
such as the spleen and liver, other parts of the body such as the in- 
testines, lung airways, central nervous system, skin and vagina are 
restrictive for memory T-cell entry®. In the latter tissues, inflammation 
or infection is often required to permit entry of circulating activated T 
cells to establish a tissue-resident memory T-cell pool that is separate 
from the circulating pool*”*. Given that side effects of inflammation in 
the reproductive tissue may preclude the use of a live prophylactic 
vaccine given vaginally, we investigated an alternative approach to 
recruit virus-specific T cells into the vaginal mucosa without inducing 
local inflammation or infection. 

After genital HSV-2 infection, chemokine (C-X-C motif) ligand 9 
(CXCL9) and CXCL10 expression is induced by interferon-y secreted 
by CD4* T cells and mediates recruitment of effector CD8* T cells to 
the infected tissue via the chemokine receptor CXCR3 (ref. 4). CKCR3 
is expressed by both effector T-helper 1 (Ty1) cells and activated 
CD8°~ T cells, as well as other cell types’. Thus, we proposed that 
the topical application of chemokines CXCL9 and CXCL10 might 
be sufficient to recruit effector T cells to the vagina in the absence of 
infection. To test this hypothesis we used T-cell antigen receptor 


transgenic CD8" T cells that recognize an epitope within the HSV 
glycoprotein B (gBT-I)’° to track the HSV-2 specific CD8 T-cell popu- 
lation. Naive female C57BL/6 mice were transplanted with 10° conge- 
nically marked gBT-I CD8™ T cells and immunized subcutaneously 
with an attenuated strain of HSV-2 that lacks thymidine kinase (TK ~ 
HSV-2)"' (Fig. 1a). As expected, this route of immunization resulted 
in minimal migration of activated CD8* T cells into the vagina 
(Fig. 1b, c). To recruit or ‘pull’ activated HSV-specific CD8"* T cells, 
the chemokines CXCL9 and CXCL10 were topically applied to the 
vaginal cavity of subcutaneously immunized mice (Fig. 1a). Another 
group of mice was immunized intravaginally with TK HSV-2, which 
served as a positive control for maximal CD8* T-cell recruitment to 
the vagina (Fig. 1b, c). At day 6 post infection, all three treatment 
groups exhibited primary CD8* T-cell responses of similar magni- 
tudes, as indicated by the numbers and percentages of systemic gBT-I 
CD8"* T cells found in the spleen (Fig. 1b, c). However, the number 
and percentage of gBT-I CD8™ T cells in the vagina were significantly 
higher in mice treated with the chemokine pull (subcutaneous 
immunization plus pull) compared to the control subcutaneously 
immunized mice (Fig. 1b, c). Furthermore, the action of the chemo- 
kine pull was restricted to the genital mucosa, as gBT-I CD8* T-cell 
recruitment to the vagina-draining iliac lymph nodes was limited 
(Fig. 1c). Activated CD4* T cells were also strongly recruited to the 
vagina by the chemokine pull (Fig. 1d). Antigen in the vagina was not 
responsible for the recruitment, as HSV-2 genomic DNA was absent 
from the genital tract after subcutaneous immunization (Sup- 
plementary Fig. 1). To mimic a vaccination scenario more closely, 
we also tested whether endogenous virus-specific T cells could be 
recruited by prime and pull. Like gBT-I CD8* T cells, the systemic 
endogenous HSV-specific CD8* T-cell response was similar in all 
immunized groups (Supplementary Fig. 2a). However, significantly 
greater numbers of HSV-specific CD8* T cells as well as CD4* T cells 
were present in the genital tracts of mice treated with the chemokine 
pull as compared to subcutaneous immunization alone (Supplemen- 
tary Fig. 2). Thus, these data show that the prime and pull method is 
capable of recruiting a large number of parenterally primed T cells to 
the genital tract with a single topical application of chemokines. 

To assess the possible inflammatory consequences of topical che- 
mokine application to the vagina, we examined the presence of innate 
inflammatory cells after the chemokine pull. Other cell types, includ- 
ing natural killer cells and plasmacytoid dendritic cells (pDCs) express 
CXCR3 (ref. 9). However, no significant increase in the number of 
pDCs, natural killer cells, granulocytes, dendritic cells, monocytes, 
macrophages and monocyte-derived dendritic cells was elicited by 
the chemokine treatment (subcutaneously immunized plus pull) com- 
pared to the subcutaneously immunized control (Supplementary Fig. 
3). These data indicated that topical chemokines do not induce appre- 
ciable recruitment of natural killer cells or pDCs to the vagina and that 
effector T cells are selectively recruited during prime and pull without 
inducing a general inflammatory response. 

During genital HSV infection, CD4” T cells act as a pioneering popu- 
lation for the migration of virus-specific CD8* T cells by inducing the 
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Figure 1 | Effector T cells are recruited to the vagina by topical chemokine 
treatment. a, Experimental schematic. Donor gBT-I CD8* T-cell recipients 
were not immunized (naive), or were immunized either intravaginally (ivag) or 
subcutaneously (s.c.) with TK” HSV-2. Five days post infection, 
subcutaneously immunized mice were treated vaginally with either the 
chemokines CXCL9 and CXCL10 (pull) or PBS. b, The frequency of donor 
gBT-I CD8" T cells 1 day post pull in the indicated tissues (ILN, iliac lymph 
nodes). Plots are gated on total CD8~ T cells and numbers indicate per cent 
gBT-I (CD45.1*). c, The number of donor gBT-I CD8" T cells 1 day post pull 


production of critical chemokines within the tissue*. To determine 
whether the recruitment of gBT-I CD8* T cells to the genital tract 
was similarly dependent on CD4* T-cell help during prime and pull, 
subcutaneously immunized mice were injected with a CD4-depleting 
antibody on day 3 and day 5 post infection to preserve normal 
CD8* T-cell priming’’, and then treated with the chemokine pull 
(Supplementary Fig. 4a). In CD4~ T-cell-depleted mice (Supplemen- 
tary Fig. 4b), both systemic gBT-I CD8" T-cell numbers and migration 
to the vagina were unaffected (Supplementary Fig. 4c, d), indicating 
that recruitment of effector CD8™ T cells to the vagina after chemokine 
treatment bypasses the requirement for CD4* T-cell help. 

CXCR3 is upregulated on T cells upon activation and remains high 
through the effector and memory stages’. Having demonstrated that 
CXCL9 and CXCL10 could recruit CXCR3” effector T cells to the 
vagina, we next examined the efficacy of the chemokine pull at different 
stages of T-cell priming. After subcutaneous TK” HSV-2 immuni- 
zation, CXKCR3 was upregulated on both gBT-I CD8* T cells and 
CD4* T cells throughout the response (Fig. 2a), suggesting that both 
effector and memory T cells should be capable of responding to the 
chemokine pull. Previous reports have shown that early effector CD8* 
T cells had an increased ability to migrate to peripheral tissues’, so we 
next determined whether the timing of chemokine pull dictated the 
efficacy of T-cell recruitment to the genital tract. When subcutaneously 
immunized mice were treated with the chemokine pull at the effector 
(day 5), contraction (day 15) and memory (day 28) phase of the T-cell 
response’, we found that the chemokine pull was most effective at 
recruiting antigen-specific CD8* T cells during the effector (day 5) 
phase, which correlated with the increased number and percentage of 
systemic gBT-I CD8* T cells (Fig. 2b). Despite similar CXCR3 expression 
(Fig. 2a), memory gBT-I CD8" T cells were not present in the tissue 
after pull when treated during the memory phase (day 28) (Fig. 2b). We 
speculate that this might be due to altered homing patterns”’* and 
the reduced number and percentage of gBT-I CD8* T cells in circula- 
tion at the memory time point. Recruitment of CD44* CD4* T cells 
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in the indicated tissues. d, The number of CD4* T cells 1 day post pull in the 
vagina. c, d, Numbers in graphs indicate fold difference in T-cell number for 
subcutaneously immunized compared with subcutaneously immunized plus 
pull. Statistical significance was determined by two-tailed unpaired Student’s 
t-test. Data are pooled from 2-7 independent experiments (n = 6-21 per 
group). All error bars show s.e.m. and *P = 0.05-0.01, **P = 0.01-0.001, 
***P < 0.001 throughout all figures (individual P values are found in the 
Supplementary Information). 
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Figure 2 | Chemokine pull is specific for highly activated effector T cells. 
Mice were subcutaneously immunized and given chemokines or PBS at day 5, 
15 or 28 post infection and analysed 1 day post pull. a, CKCR3 expression on 
donor gBT-I CD8* T cells or CD44*CD4" T cells from the spleen 1 day post 
pull from subcutaneously immunized mice (open black) and subcutaneously 
immunized plus pull (open red). Shaded histograms are CD44'°CD8* or total 
CD4* T cells. b, The gBT-I CD8™~ T cell number in the vagina (left) or spleen 
(middle) and frequency in the spleen (right) were examined 1 day post pull. 
c, The number of CD44*CD4" T cells in the vagina 1 day post pull on the 
indicated days post infection. d, The number of endogenous CD44*CD8* T 
cells in the vagina 1 day post pull on the indicated days post infection. 

b-d, Statistical significance was determined by two-tailed unpaired Student’s 
t-test. Data are pooled from 2-3 independent experiments; n = 6-18 per group. 
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(Fig. 2c) and endogenous CD8°* T cells (Fig. 2d) followed a similar 
pattern. Collectively, these data indicate that the chemokine pull is most 
effective at recruiting recently activated effector CD8* T cells that are 
circulating at high frequency, establishing a specific time frame within 
which the chemokine pull should be administered after priming. 

For the prime and pull approach to be an effective vaccine strategy, 
pathogen-specific T cells must be retained within the tissue for an 
extended time and establish a pool of memory cells. To determine 
whether the effector gBT-I CD8" T cells recruited into the vagina after 
prime and pull were capable of establishing a long-term population of 
memory CD8T cells, we examined the presence of gBT-I CD8"* T cells 
4 weeks after the chemokine pull. The number of systemic memory 
gBT-I CD8"* T cells, although decreased compared to day 1 post pull 
(Fig. 1c) due to contraction of the T-cell response, was similar regard- 
less of immunization route or treatment (Fig. 3a). However, a signifi- 
cantly greater number and percentage of memory gBT-I CD8™ T cells 
was present in the genital tract of subcutaneously immunized mice 
treated with the chemokine (subcutaneously immunized plus pull) 
compared to chemokine untreated mice (subcutaneously immunized) 
(Fig. 3a, b). Despite significant recruitment during the effector phase 
(Fig. 1d), CD4* T cells were not retained within the vagina long term 
(Fig. 3c), reminiscent of CD4* T-cell behaviour after dermal HSV-1 
infection in which the cells leave the site of infection to mediate immu- 
nosurveillance!®. Thus, CD4* T cells may require additional signals, 
such as those generated during HSV-2 infection'*”, to be retained long 
term within the vagina. 

To investigate the stability of this tissue-resident population of 
memory gBT-I CD8* T cells, we also examined T-cell numbers at 
12 weeks post pull. Donor gBT-I CD8* T-cell numbers in the vagina 
were significantly higher after prime and pull than after subcutaneous 
immunization alone (Fig. 3d). Furthermore, the number of memory 
gBT-I CD8* T cells did not decline between 4 weeks and 12 weeks 
(Fig. 3d), suggesting that this tissue-resident population was stable and 
retained long term. CD4* T-cell numbers in the vagina remained low 
at week 12 after prime and pull, and were comparable to numbers 
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detected at week 4 (Fig. 3e). Thus, a single chemokine pull given to 
mice during the effector phase is sufficient to establish a long-term 
population of tissue-resident memory CD8°™ T cells, but not CD4* T 
cells, within the vagina. 

Tissue-resident memory T cells are effective in mediating immunity 
against local infections”’. HSV-2 spreads from its initial replication site 
at the epithelium to the innervating neurons, and subsequently estab- 
lishes latency within the dorsal root ganglia (DRG)’. Reactivation from 
latency leads to viral shedding and formation of genital lesions that are 
commonly associated with genital herpes’. Thus, preventing the 
spread of virus from the mucosal epithelium to the DRG is key in 
preventing disease and transmission of the virus. As a single chemo- 
kine pull administered after subcutaneous immunization is capable of 
establishing a population of tissue-resident memory CD8* T cells 
within the vagina long term, we next examined whether the prime 
and pull strategy would provide enhanced immunity against genital 
HSV-2 infection. Mice were challenged intravaginally with a lethal 
dose of wild-type HSV-2 4 weeks after prime and pull and monitored 
for disease and survival. Notably, mice treated with the chemokine pull 
(subcutaneously immunized plus pull) lost significantly less weight 
than either the non-immunized or subcutaneously immunized controls 
(Fig. 4a). Furthermore, prime and pull almost completely prevented 
the development of clinical symptoms, which were observed in both 
non-immunized and subcutaneously immunized controls (Fig. 4b). 
Accordingly, mice treated with the chemokine pull had a 100% sur- 
vival rate compared to the 36.3% survival rate of the subcutaneously 
immunized control (Fig. 4c). Upon challenge with wild-type HSV-2 
4weeks post pull, mice immunized and chemokine-treated in the 
absence of T-cell antigen receptor transgenic CD8* T cells were also 
significantly protected from weight loss (Supplementary Fig. 5a) and 
clinical disease (Supplementary Fig. 5b), although we did not observe a 
significant difference in survival rate (Supplementary Fig. 5c). Anti- HSV 
antibody titres in the vagina were not significantly different between 
subcutaneously immunized controls and chemokine-treated mice (Sup- 
plementary Fig. 6), suggesting that the control of viral challenge was 
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Figure 3 | Virus-specific T cells recruited by chemokine pull are retained in 
the vagina long term. a, Mice were immunized and treated as shown in Fig. la. 
At 4 weeks post pull, numbers of gBT-I CD8~ T cells were determined in the 
indicated tissues. The number inside the graph shows fold difference in gBT-I 
number between subcutaneously immunized compared to subcutaneously 
immunized plus pull groups. b, Four weeks post pull, the frequency of gBT-I 
cells was measured in the vagina. Plots are gated on total CD8* T cells. 
Numbers show per cent gBT-I (CD45.1" ).c, The number of endogenous CD4* 
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T cells in the vagina at 4 weeks post pull. d, The numbers of gBT-I cells were 
determined in the vagina at 12 weeks post pull (left) and compared to numbers 
at 4 weeks post pull (right). Number * ND, number of animals with no cells 
detected in the tissue. e, The numbers of CD4* T cells were determined in the 
vagina at 12 weeks post pull (left) and compared to the corresponding numbers 
at 4 weeks post pull (right). Statistical significance was determined by two-tailed 
unpaired Student’s t-test. NS, not significant. Data are pooled from 2-3 
independent experiments; n = 4-15 per group). 
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Figure 4 | Prime and pull protects mice from lethal genital HSV-2 challenge. 
a, Weight loss in mice immunized as shown in Fig. 1a and then challenged 
vaginally with a lethal dose of HSV-2 4 weeks post pull. b, Disease severity in 
mice immunized as shown in Fig. 1a and then challenged vaginally with a lethal 
dose of HSV-2 four weeks post pull. A higher disease score indicates more 
severe disease symptoms. c, Survival in mice immunized as shown in Fig. la and 
then challenged vaginally with a lethal dose of HSV-2 4 weeks post pull. 

d, HSV-2 viral titres from vaginal washes collected at the indicated time points 
post challenge with HSV-2. Dashed line indicates limit of detection, none 
detected. n = 11 (non-immunized), n = 9 (intravaginal immunization), n = 12 
(subcutaneously immunized control, subcutaneously immunized plus pull). 
e, Viral titres were measured in the dorsal root ganglia 6-7 days post challenge. 
Number X ND, number of mice in which no virus was detected. n = 6-11 per 
group. Statistical significance was measured by two-way ANOVA (a, b, d), log- 
rank (Mantel-Cox) test (c) or two-tailed unpaired Student’s t-test (e). Data are 
pooled from 3-5 independent experiments. 


probably T-cell mediated. These results demonstrate that the addition of 
a chemokine pull to parenteral immunization could greatly enhance 
protective immunity against genital HSV-2 infection. 

To test more stringently the long-term protection afforded by prime 
and pull, we also challenged mice 10-12 weeks post pull. At this late 
time point, the prime and pull group lost less weight compared to 
subcutaneously immunized controls (Supplementary Fig. 7a), and 
were significantly protected from development of disease (Supplemen- 
tary Fig. 7b). Furthermore, at 2 weeks post challenge, the prime and 
pull group had a survival rate of 100%, whereas subcutaneously immu- 
nized controls had a survival rate of 57% (Supplementary Fig. 7c). 
Thus, our results show that the protection provided by prime and pull 
lasts over time and remains robust up to 12 weeks after chemokine 
treatment. 

To determine the mechanism by which the prime and pull strategy 
mediates protection from HSV-2 disease, we measured viral replica- 
tion within the genital mucosa. Notably, we found no difference in 
virus titres from the vaginal secretion of subcutaneously immunized 
versus chemokine-treated (subcutaneously immunized plus pull) mice 
(Fig. 4d and Supplementary Figs. 5d and 74d), indicating that protection 
was probably being mediated at a different location. As the more severe 
symptoms of clinical disease in mice are associated with viral replica- 
tion in the peripheral nervous system'*, we next investigated whether 
prime and pull could protect the DRG against infection. When viral 
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replication within the DRG was measured, we found that mice treated 
with the chemokine pull had significantly lower virus titres than non- 
immunized mice (Fig. 4e). Furthermore, viral titres in the DRG of 
the prime and pull group were significantly lower than that of sub- 
cutaneously immunized mice (Fig. 4e). Together, these data indicate 
that prime and pull strategy greatly reduces disease by controlling 
neuronal infection with HSV-2 rather than by controlling mucosal 
viral replication. 

Our study demonstrates that after conventional vaccination to 
generate a systemic T-cell population (prime), a single topical treat- 
ment with chemokines applied vaginally (pull) can provide superior 
protection against genital herpes by decreasing the spread of virus 
from the mucosal epithelia into the neurons. Importantly, protection 
of neurons from HSV-2 infection by prime and pull may decrease 
reactivation and viral shedding, which may reduce disease and trans- 
mission. Although the exact role of T cells in controlling neuronal 
HSV-2 infection after prime and pull is not yet clear, we speculate that 
the local HSV-specific T cells may help to control entry of virus at the 
neuronal endings, or promote blockade of viral replication once inside 
the neurons. Furthermore, other studies have demonstrated that T 
cells recruited to the genital tract by inflammation alone can decrease 
viral replication at the mucosal surface’’, suggesting that control of 
infection at the site of entry may be possible by optimizing prime and 
pull. Thus, in addition to preventing reactivation of latent HSV”, 
virus-specific memory T cells may be mobilized to control neuronal 
viral infection during primary infection. Although topical application 
in the genital tract of Toll-like receptor ligands such as imiquimod 
have been shown to be effective as a therapeutic approach”’, they may 
not be ideal vaccine candidates as they seem to be effective for only a 
short time after application and function through the induction of pro- 
inflammatory cytokines”. The novel prime and pull vaccine strategy 
described here provides an alternative to direct immunization of the 
genital tract, and establishes robust, long-term immunity with minimal 
local inflammation. 

Cellular immunity is critical in mediating protection against viral 
STIs such as HSV-2 and HIV-1 (ref. 23). Both viruses enter through 
the genital mucosa, begin local replication and then spread to other 
tissues”. Although our data highlight the role of prime and pull in 
controlling viral spread to the peripheral nervous system, this vaccina- 
tion approach is not necessarily restricted to neurotropic viruses. HIV- 
1 enters the genital mucosa and invades the draining lymph node, from 
which systemic dissemination of the virus occurs”’, In its current form, 
prime and pull establishes tissue-resident memory CD8~ T cells but 
not CD4" T cells. Given that a single HIV-1 virion can establish infec- 
tion in humans”, local memory CD8" T cells may be key to protection 
against HIV-1 (ref. 23) by reducing replication and dissemination of 
the founder virus, while the absence of local CD4* T cells could limit 
the availability of immediate target cells. Beyond viral infections, prime 
and pull could be applied to improve recruitment of immune cells to 
other restrictive microenvironments such as solid tumours. Effective 
immunotherapy can be hindered by either decreased or inappropriate 
expression of chemokines at the tumour tissue, leading to minimal 
migration of immune cells™*. Delivery of appropriate chemokines to 
the tumour tissue after immunization could enhance recruitment of 
tumour-specific T cells and augment the efficacy of immunotherapies. 
Although the protocol we present pairs the pull with a suboptimal 
subcutaneous immunization, we propose that the prime and pull strat- 
egy could be used in conjunction with more effective immunizations? 
to enhance protection. Ultimately, the ability to boost recruitment of T 
cells and establish resident T-cell populations in peripheral tissues 
restrictive for lymphocyte homing will aid not only in the prevention 
but also in the treatment of a wide variety of diseases. 


METHODS SUMMARY 


Adoptive transfers, infections and T-cell depletion. CD8* T cells (10°) from 
the spleens of naive CD45.1' gBT-I TCR transgenic mice’? were adoptively 
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transferred into Depo-Provera (GE Healthcare) treated”, naive 6-week-old 
C57BL/6 recipients (National Cancer Institute). Recipients were immunized intra- 
vaginally or subcutaneously in the neck ruff with 10° or 10° plaque forming units 
(PFU) of 186TKAkpn HSV-2 (TK HSV-2)!', respectively. Some mice were 
treated twice with 200 1g anti-CD4 antibody (GK1.5) intraperitoneally to deplete 
CD4" T cells. Five days post-immunization, subcutaneously immunized mice 
were vaginally swabbed with a Calginate swab (Fisher) and either PBS or a solution 
of CXCL9 and CXCL10 (3 pig each, Peprotech) in PBS was delivered via pipette tip 
into the vagina. For 4-week challenges, mice were infected intravaginally with 
5,000 PFU of wild-type HSV-2 186 syn+ (ref. 26). For 10-12 week challenges, 
mice were treated again with Depo-Provera 9-10 weeks before challenge. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Mice. Female 6-week-old C57BL/6 mice were purchased from the National 
Cancer Institute. gBT-I T cell antigen receptor (TCR) transgenic mice specific 
for the glycoprotein B epitope gB(498-505) were provided by F. R. Carbone and 
W. R. Heath and bred in our facility to C57BL/6-Ly5.2Cr mice (CD45.1*) 
(National Cancer Institute). All procedures used in this study complied with 
federal and institutional policies of the Yale Animal Care and Use committee. 
Adoptive transfers and infections. Spleens were collected from naive CD45.1~ 
gBT-I TCR transgenic mice and CD8" T cells were magnetically purified by CD80 
microbeads or CD8«+ T-cell isolation kits (Miltenyi Biotec). Donor cells (10°) 
gBT-I CD8* T cells were adoptively transferred into Depo-Provera-treated (GE 
Healthcare), 7-8-week-old C57BL/6 recipients retro-orbitally. Mice were then 
immunized intravaginally or subcutaneously with 10° or 10° plaque forming units 
(PFU) of 186TKAkpn HSV-2 (TK HSV-2) respectively. At 5 days post-infection, 
the vaginal cavity of mice was swabbed with a Calginate swab (Fisher) and either 
PBS or a solution of CXCL9 and CXCL10 (3 1g each, Peprotech) in PBS was 
delivered via pipette tip into the vagina. Where indicated, C57BL/6 mice that 
did not receive gBT-I cells were primed and pulled in a similar manner. Some 
subcutaneously immunized mice were intraperitoneally injected with 200 jug anti- 
CD4 (GK1.5) antibody at day 3 and 5 post infection to deplete CD4* T cells. For 
the 4-week challenge, non-immunized or previously immunized mice at the indi- 
cated time points were infected intravaginally with 5,000 PFU wild-type HSV-2 
186 syn+. Challenges given at 10-12 weeks post pull were treated with Depo- 
Provera for a second time 1-2 weeks post pull (9-10 weeks before challenge) 
before infection with 5,000 PFU wild-type HSV-2 186 syn+. 

Flow cytometry. At various time points, single cell suspensions from the spleen, 
lungs, vagina and iliac lymph nodes were prepared for analysis as described’. Briefly, 
lungs were digested with collagenase D (Roche). Vaginas were treated with Dispase 
II (Roche) for 15 min and then collagenase D for 30 min. Cells from the spleen and 
iliac lymph node were counted by haemocytometer. Lung and vagina cell numbers 
were quantified using CountBright absolute counting beads (Invitrogen). Dead cells 
were excluded from analysis using the LIVE/DEAD Fixable Aqua Dead Cell stain kit 
(Invitrogen). All samples were acquired on an LSRII equipped with a 532-nm green 
laser (BD Biosciences). All data were analysed with FlowJo (Treestar). 
Antibodies. The following antibodies were used for this study: CD3 (17A2), CD8 
(53-6.7), CD44 (1M7), CD45.1 (A20), CXCR3 (CXCR3-173), CD11c (N418), 
CD11b (M1/70), MHC class II (M5/114.15.2), Ly6G (1A8), F4/80 (BM8), B220 
(RA3-6B2), CD19 (ebio1D3) and NK1.1 (PD136) (Biolegend); Ly6C (AL-21)(BD 
Biosciences); CD4 (RM4-4) (eBioscience); and CD4 (RM4-5) (Biolegend and 
Invitrogen). H-2K?-gBayg_sos tetramer was obtained from the National Institutes 
of Health tetramer core facility. 


Measurement of viral titres, weight and disease scores. Vaginal secretions were 
collected 5 days post challenge using PBS and Calginate swabs. Lumbar and sacral 
dorsal root ganglia (DRG) were collected at days 6-7 post challenge as described”. 
DRG were homogenized using a motorized pestle (VWR). Titres from vaginal and 
DRG samples were measured on Vero cell monolayers as previously described”. 
Weight loss was measured daily and normalized to body weight on day 0 of 
challenge. Disease was monitored daily and scored as follows: (0) no disease; (1) 
genital inflammation; (2) genital lesions and hair loss; (3) hunched posture and 
ruffled fur; (4) hind limb paralysis; and (5) premoribund”. Mice were euthanized 
before reaching the moribund state due to humane concerns. 

Detection of HSV-2 antigen by quantitative PCR. Mice were immunized sub- 
cutaneously or intravaginally and were killed at day 5 post infection. Vaginal tissue 
was collected and genomic DNA was extracted as previously described”. Briefly, 
tissue was homogenized in a salt homogenizing buffer using a motorized pestle. 
Proteinase K and SDS were added to samples and incubated overnight at 55 °C. 
After addition of a sodium chloride solution, samples were centrifuged and super- 
natants were transferred to new tubes. Isopropanol was added to the supernatants 
and incubated at 20°C for 1h. DNA was pelleted by centrifugation, washed with 
ethanol and resuspended in H.O. HSV-2 was measured with primers detecting 
glycoprotein B (gB) (Forward: 5'-AGACCAGGGCCGCTGATC-3’; reverse: 5’- 
GCGCTGGACCTCCGTGTAG-3’) with quantitative polymerase chain reaction 
(Stratagene). DNA purified from TK HSV-2 was used as standard to calculate 
PFU equivalents. 

Measurement of HSV-specific antibody titres. Vaginal secretions were collected 
from mice with PBS and Calginate swabs 4 weeks post pull. HSV-specific 
immunoglobulin-G (IgG) was measured by ELISA assay as previously described”. 
Known quantities of anti-HSV gB monoclonal antibody (SS10 mouse IgG) kindly 
provided by G. Cohen and R. Eisenberg was used as a standard. 


27. alin, S.A., Davis, B.M. & Molliver, D.C. Production of dissociated sensory neuron 
cultures and considerations for their use in studying neuronal function and 
plasticity. Nature Protocols 2, 152-160 (2007). 
28. orrison, L. A., Da Costa, X. J. & Knipe, D. M. Influence of mucosal and 
parenteral immunization with a replication-defective mutant of HSV-2 on 
immune responses and protection from genital challenge. Virology 243, 
178-187 (1998). 
29. Aljanabi, S. M. & Martinez, |. Universal and rapid salt-extraction of high quality 
genomic DNA for PCR-based techniques. Nucleic Acids Res. 25, 4692-4693 
(1997). 
30. Soderberg, K. A., Linehan, M. M., Ruddle, N. H. & lwasaki, A. MAdCAM-1 
expressing sacral lymph node in the lymphotoxin B-deficient mouse provides a 
site for immune generation following vaginal herpes simplex virus-2 infection. 
J. Immunol. 173, 1908-1913 (2004). 
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Calcium ions (Ca**) have an important role as secondary messen- 
gers in numerous signal transduction processes’ *, and cells invest 
much energy in controlling and maintaining a steep gradient 
between intracellular (~0.1-micromolar) and extracellular (~2- 
millimolar) Ca?* concentrations!. Calmodulin-stimulated calcium 
pumps, which include the plasma-membrane Ca**-ATPases 
(PMCAs), are key regulators of intracellular Ca”* in eukaryotes**. 
They contain a unique amino- or carboxy-terminal regulatory 
domain responsible for autoinhibition, and binding of calcium- 
loaded calmodulin to this domain releases autoinhibition and acti- 
vates the pump. However, the structural basis for the activation 
mechanism is unknown and a key remaining question is how cal- 
modulin-mediated PMCA regulation can cover both basal Ca?* 
levels in the nanomolar range as well as micromolar-range Ca** 
transients generated by cell stimulation’. Here we present an inte- 
grated study combining the determination of the high-resolution 
crystal structure of a PMCA regulatory-domain/calmodulin com- 
plex with in vivo characterization and biochemical, biophysical and 
bioinformatics data that provide mechanistic insights into a two- 
step PMCA activation mechanism mediated by calcium-loaded cal- 
modulin. The structure shows the entire PMCA regulatory domain 
and reveals an unexpected 2:1 stoichiometry with two calcium- 
loaded calmodulin molecules binding to different sites on a long 
helix. A multifaceted characterization of the role of both sites leads 
to a general structural model for calmodulin-mediated regulation of 
PMCAs that allows stringent, highly responsive control of intracel- 
lular calcium in eukaryotes, making it possible to maintain a stable, 
basal level at a threshold Ca”* concentration, where steep activation 
occurs. 

Compared with other P-type ATPases, calmodulin-stimulated cal- 
cium pumps contain an additional autoinhibitory or regulatory (R) 
domain at the N terminus’ (in plants) or C terminus'® ’* (in mammals) 
(Supplementary Fig. 1). Pump activity is stimulated by binding of 
calcium-loaded calmodulin (Ca**-CaM) to the regulatory domain, 
thereby inducing a conformational change that displaces the autoin- 
hibitory domain from the pump core structure’**. Calmodulin (CaM) 
substantially increases both Ca" affinity and pump rate in PMCAs'. 
To investigate the mechanism of CaM-mediated PMCA regulation, we 
have determined the structure of an intact regulatory domain (residues 
40-95) of the Arabidopsis thaliana PMCA (we denote this domain 
Aca8R) in a homologous complex with A. thaliana Cam7 and char- 
acterized it in vivo and in vitro. 

The crystal structure of this complex, determined at a resolution of 
1.95 A, shows an overall dumbbell shape and reveals an unexpected 2:1 
stoichiometry (Fig. la and Supplementary Table 1): Aca8R forms a 56- 
residue-long «-helix with two Ca**-Cam7 molecules bound at two 
distinct CaM-binding sites within the autoinhibitory domain in a 


pseudo-symmetrically ‘inverted’ arrangement. Given this structure, 
we denote the complex (Cam7),—Aca8R. The two binding sites are 
separated in space by eight residues with no interaction between the 
two CaM molecules. Although the first binding site (CaMBS1) had 
been previously mapped by mutational studies’, the second binding 
site (CaMBS2) was unknown and unexpected. 

Both sites show typical structural characteristics of CaM-target 
recognition’®. The hydrophobic anchor points are Trp 47 and Phe 60 
in CaMBS1 and Ile 79 and Phe 92 in CaMBS2. Each anchor residue is 
docked in a hydrophobic cage formed in each lobe of Cam7 (Fig. 1b-d 
and Supplementary Figs 2 and 3). In both of the binding sites the 
anchor points show a 1-14 spacing, whereas in CaM complexes of 
human PMCA4b” and plant BCA1"* a 1-18 spacing of anchor resi- 
dues has been observed. A superimposition of the two bound CaM 
molecules reveals that the molecule bound to CaMBS1 has a slightly 
more closed conformation. The long «-helix formed by the regulatory 
domain of Aca8 shows a minor bending in CaMBS1. 

Small-angle X-ray scattering (SAXS) data of the purified Cam7- 
Aca8R complex in solution agree with the crystal structure of the 2:1 
complex. The distance distribution function shows two separated 
peaks indicating the presence of two subdomains (Supplementary 
Fig. 4), and the ab initio SAXS model superimposes with the crystal 
structure (Supplementary Fig. 4). In contrast, a putative 1:1 Cam7- 
Aca8R binding model does not fit the experimental SAXS data, indi- 
cating that the 2:1 (Cam7),—-Aca8R complex is indeed the favoured, 
stable complex in solution. 

Using isothermal titration calorimetry (ITC), we measured CaM 
binding to the isolated binding sites as well as to a construct containing 
the entire autoinhibitory/regulatory domain of the protein Aca8. 
Cam7 binds tightly to peptides corresponding to CaMBS1 (dissoci- 
ation constant, Kgy=13nM) and CaMBS2 (Kyg=0.5uM) of Aca8 
(Fig. le, fand Supplementary Table 2). When measuring CaM binding 
to a construct comprising the entire autoinhibitory/regulatory domain 
(Aca8(40-126)), biphasic binding to two CaM-binding sites was 
observed with affinities similar to those of binding to individual pep- 
tides (Fig. 1g), whereas apo-CaM (stripped using EDTA) did not bind 
to any construct (data not shown). This indicates that the regulatory 
domain of Aca8 indeed contains two independent sites that bind 
Ca**-CaM with different, but physiologically relevant, affinities. 

We performed functional complementation assays using a Ca” - 
ATPase-deficient yeast strain (K616'°”°). This allowed us to investigate 
the autoinhibitory function of the R domain from PMCA gain-of- 
function mutations, because heterologous expression of a constitu- 
tively active Ca** pump in the endoplasmic reticulum of the yeast is 
required to maintain internal calcium stores on growth in EGTA- 
containing medium (see Supplementary Information for a detailed 
account of this model). Indeed, without galactose-driven expression 
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Figure 1 | Overall structure of the A. thaliana (Cam7),-Aca8R complex. 

a, Representation with CaM molecules in dark green (CaMBS1) and dark blue 
(CaMBS2), and Aca8R in orange, light green (CaMBS1) and cyan (CaMBS2). 
Ca?* is shown in magenta. b, c, Detailed view of hydrophobic anchor residues 
involved in CaM-binding in CaMBS1 (b) and CaMBS2 (c). Cam7 is shown as 
ribbon. Residues in Cam7 that are within van der Waals bonding distance of the 
Aca8R anchor residues (red sticks) are shown as sticks with yellow surfaces. 
d, Electron density illustrating the extended o-helix (containing both 


of Aca8 variants, none of the K616 yeast grows in Ca” ’-depleted 
medium (Fig. 2a, SD medium). However, expression of a constitutively 
active Ca~” pump by virtue of truncation of either one or both CaM- 
binding sites (A74Aca8 or A100Aca8) complements the lack of endo- 
genous yeast Ca**-ATPases and allows growth in the Ca~*-depleted 
medium (Fig. 2a, SG medium), whereas the wild-type autoinhibited 
Aca8 does not. Yeasts with anchor-point mutations in Aca8 CaMBS1 
(Trp47Ala and Phe60Ala) but not in CaMBS2 (Phe92Ala) show mod- 
erate growth, indicating that displacement of CaMBS1 alone generates 
basal pump activity. The Trp47AlaPhe92Ala double mutant with 
mutation in both CaM-binding sites shows significantly better growth 
(Fig. 2a) than the Trp47Ala mutant, indicating further involvement of 
anchor point Phe92Ala (CaMBS2) in autoinhibition. Expression levels 
were similar for all Aca8 constructs (Supplementary Fig. 5). The results 
confirm that both CaM-binding sites contribute to autoinhibition of 
the pump. 

The Ca**-stimulated ATPase activity of wild-type Aca8 further 
shows that the pump encompasses both a high- and a low-affinity 
sensor for Ca?" (via Ca**-CaM; Fig. 2b). Wild-type Aca8 is inactive 
(autoinhibited) in the absence of CaM and active in its presence. The 
A74Aca8 truncated form, lacking the high-affinity CaM-binding site, 
shows basal activity and can be further activated twofold by CaM, 
whereas the A100Aca8 form, which lacks both CaM-binding sites, shows 
CaM-independent activity (Fig. 2b). Furthermore, the characterization 
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CaM-binding sites) formed by the autoinhibitory region. e-g, Binding of CaM 
to the autoinhibitory domain of Aca8, examined by ITC. Typical ITC data are 
shown for the binding of Cam7 to 24-mer peptides corresponding to CaMBS1 
(e) and CaMBS82 (f) of Aca8 and to Aca8(40-126) containing both CaM- 
binding sites (g). Upper panels: raw heat pulse data. Lower panels: integrated 
heat pulses, normalized per mole of injectant, giving differential binding curves 
that are adequately described by a one-site (e and f) or two-site (g) binding 
model with Ky = 13nM for CaMBS1 and Kg = 500 nM for CaMBS2. 


of the different Aca8 mutants supports the proposal that both CaM- 
binding sites contribute to autoinhibition as well as to Ca”*-CaM- 
mediated activation of the pump (Fig. 2c). 

The presence of two CaM-binding sites is, however, not unique to 
the A. thaliana Aca8 protein. Sequence analysis suggests that both 
CaM-binding sites are conserved in other Arabidopsis Aca isoforms 
and plant PMCA homologues (Supplementary Fig. 6). Even more 
surprisingly, our analysis of mammalian PMCA (PMCAI to 
PMCA4) sequences indicates that tandem CaM-binding sites are gen- 
erated through alternative splicing in several variants (Fig. 3). The 
autoinhibitory domain of mammalian PMCAs is C-terminal and sub- 
ject to tissue- and cell-type-specific alternative splicing’”’. PMCA 
splice variants show not only differences with respect to inhibition 
and CaM activation but also in response to different Ca?* signals’, 
supporting the idea that alternative splicing might be a general mech- 
anism for the fine-tuning of intracellular signalling’. Binding 
experiments by ITC using peptides of PMCA splice variants 
(PMCA1 to PMCA4) confirmed that CaM binds to their second 
sites with Kg values in the low-micromolar range (Fig. 3a—d and 
Supplementary Table 2), indicating that several mammalian PMCA 
variants also interact with two CaM molecules. The spacing between 
the CaM-binding sites is comparable to that in Aca8, suggesting a 
similar 2:1 structural arrangement and a similar mechanism of two- 
step, Ca”*-dependent activation by CaM. 
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Figure 2 | Physiological and biochemical characterization of wild-type Aca8 
and mutants. a, Functional complementation of the Ca?*-ATPase-deficient 

yeast strain (K616) expressing wild-type and mutants of Aca8. Yeast strains K601 
(wild type) and K616 transformed with empty vector are used as positive and 

negative controls, respectively. K616 transformed with wild-type and mutants of 
Aca8 regulated by a galactose-inducible promoter are grown on galactose (SG) or 
glucose (SD) medium supplemented with either 10 mM CaCl, (Ca?*) or 10mM 
EGTA. Only deregulated pumps are able to complement the lack of endogenous 
yeast Ca?*-ATPases in Ca** -depleted medium (SG EGTA). b, Ca?*-stimulated 
ATPase activity of wild-type Aca8 and truncation variants lacking either CAMBS1 
(A74Aca8) or both CaM-binding sites (A100Aca8) in the presence or absence of 
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10 1M Cam7. The asterisk denotes the concentration of free Ca** measured by 
the Ca”*-sensitive dye fura-2 (4.80 1M), which gave rise to the greatest difference 
between the activities with and without CaM and was used for normalization 
(each construct was normalized separately to allow direct comparison of the effect 
of CaM) and further characterization. The plot shows the results of one of two 
independent experiments carried out in triplicate. Errors, s.d. c, Top: ATPase 
activity of wild-type and mutant Aca8. The ratio of CaM-independent activity 
(Ca’* with no Cam7) is given relative to the Ca** -CaM-dependent activity (Ca** 
with 10 4M Cam7). Errors, s.d. (2-4 independent experiments in triplicate). 
Bottom: schematic of the two CaM-binding sites; the position of anchor point 
residues and the parts removed in the truncated versions of Aca8 are indicated. 
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Figure 3 | Binding of CaM to the putative second CaM-binding site of 
PMCA splice variants. a—d, Typical ITC data for the binding of mammalian 
CaM to 24-mer peptides corresponding to splice variants of PMCA1 (a), PMCA2 
(b), PMCA3 (c), and PMCA4 (d). Upper panels: raw heat pulse data. Lower 
panels: integrated heat pulses, normalized per mole of injectant, giving differential 
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binding curves that are adequately described by a one-site binding model. 

e, Control peptide lacking the hydrophobic anchor residues of PMCA4d (Val4Ala, 
Leu6Ala and Phe19Ala), illustrating that CaM binding is specific. f, Alignment of 
Aca8 and mammalian PMCA splice variants coloured according to the CLUSTAL 
colouring scheme. The two CaM-binding sites are enclosed with boxes. 
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Figure 4 | Schematic of the proposed two-step, Ca”*-mediated CaM- 
activation mechanism. With increasing Ca’* concentration, Ca”* -CaM first 
binds to and displaces high-affinity CaMBS1 before even higher Ca** 
concentration leads to displacement of CaMBS2 from the catalytic core, 


Using mathematical network modelling of two-binding-site Ca** - 
CaM activation of PMCA, we find that the pump system is ready for 
steep activation above a basal Ca”* concentration, meaning that the 
enzyme is inactive below this level and rapidly activates as soon as 
concentrations exceed it (Supplementary Fig. 7). This gives our results 
particular physiological relevance, because Ca” activation through 
just one or no CaM-binding or autoinhibitory site (assuming other- 
wise identical conditions) is much less abrupt, meaning that a minimal 
cellular Ca** concentration would be less stable and acceleration of 
pump activity at increased Ca" levels less pronounced. 

By combining structural, physiological and biochemical data with 
modelling, we also derive a structural model for Aca8 in its autoinhib- 
ited state. Aca8 homology models in Ca’ *-bound (E1) and Ca’*-free 
(E2) conformations reveal the presence of a highly conserved cleft, 
situated between the A domain and the N and P domains, that is fully 
exposed in the E2 conformation and buried in the El conformation 
(Supplementary Video 1 and Supplementary Figs 8 and 9). Several 
acidic residues previously reported to be involved in autoinhibition 
(Glu 252, Asp 303 and Asp 332) cluster along this region® (Sup- 
plementary Fig. 9). The N-terminal autoinhibitory domain of Aca8, 
however, contains several conserved basic residues'® (Arg 58, Arg 61, 
Lys 67 and Lys 68), which are of critical importance for autoinhibition 
and could potentially interact with the acidic residues mentioned 
above (Supplementary Fig. 8). We suggest that as the Ca** concen- 
tration increases, Ca’ *-CaM first binds to and displaces the high- 
affinity CaMBS1 in the region between the A and N domains, and 
that further Ca**-CaM activation leads to displacement of CaMBS2 
from the linker regions between the transmembrane domain and the 
mobile A domain (Fig. 4 and Supplementary Fig. 8). Displacement of 
CaMBS1 allows free movement of the cytoplasmic domains as 
required for ion pumping at basal levels, and the additional displace- 
ment of CaMBS2 by higher Ca” *-CaM concentrations or the presence 
of acidic phospholipids”® results in further activity. This model pro- 
vides a conceptual framework for a bimodular, Ca’*-mediated CaM- 
activation mechanism that allows regulation of intracellular Ca** con- 
centration over a broad range of physiological conditions, ready to 
respond promptly at increased Ca~" levels and setting the basal 
Ca** level in the cell. Such a mechanism seems to be well suited to 
the fine-tuning of calcium homeostasis and intracellular signalling in 
eukaryotes. 


METHODS SUMMARY 


Procedures for the expression, purification and crystallization of the (Cam7)2- 
Aca8R complex were similar to those described in ref. 27. The complex was 
crystallized at 4 °C using sitting-drop vapour diffusion against a reservoir contain- 
ing 1.9 M (NH4)2SOy, 0.1 M CAPS (pH 10.5) and 0.2 M Li,SO,. The crystals have 
space group P4,2,2 and unit-cell parameters a = b = 71.25 A, c = 163.28 A and 


Basal activated state 


LETTER 


Fully activated state 


allowing free movement of the A domain as required for ion pumping. Actuator 
(A), nucleotide-binding (N) and phosphorylation (P) domains and the 
transmembrane region are indicated. Aca8R is shown as cylinder with the 
CaM-binding site colour-coded as in Fig. 1. 


a = B = y = 90°. Diffraction data were collected to a resolution limit of 1.95 A at 
the 1D23-2 beamline of the European Synchrotron Radiation Facility (France). 
The structure was determined by molecular replacement followed by iterative 
model building and refinement. The final model yielded a crystallographic 
R-factor of 22.1% and a free R-factor of 25.3%. Evaluation of the Ra- 
machandran plot showed all residues except one (99.7%) in allowed regions 
(97.0% in favoured regions). SAXS data were collected at the X33 beamline at 
EMBL/DESY (Germany), following standard procedures. Heterologous express- 
ion, yeast complementation tests, membrane purification and activity assays of 
full-length Aca8, N-terminal truncation and single-point-mutation constructs 
were performed essentially as previously described'***. More detailed methods 
can be found in Method and Supplementary Information. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Purification and crystallization of (Cam7),-Aca8R. Purification of the complex 
and initial crystallization was performed as previously described”. In brief, 
Aca8(40-95) and Cam7 (both from A. thaliana) were co-expressed in Escherichia 
coli C41 cells” and purified using standard His-tag purification protocols followed 
by TEV protease digestion, a second Ni-affinity chromatography step to separate a 
fusion protein tag and a final gel-filtration step. Initial crystals were obtained using 
the vapour diffusion technique in sitting drops prepared by mixing 1.5 ul of 
(Cam7),—Aca8R solution (16 mg ml ~ ') with 1 ul of reservoir solution, and equili- 
brated against reservoir solution containing 2.0M (NHy4)2SO,, 0.1M CAPS 
(pH 10.5) and 0.2M Li,SO, (final pH 8.2) at room temperature (293 K). These 
crystals belonged to space group C2 and diffracted to a resolution of 3.0 A (ref. 
27). Better-diffracting crystals belonging to a different space group (P4,2)2) were 
obtained at 4 °C under slightly modified conditions: 1 pil of (Cam7).-Aca8R solu- 
tion (16 mg ml’) was mixed with 1 ull of reservoir solution and equilibrated against 
reservoir solution containing 1.9 M (NH,4)2SOu, 0.1 M CAPS (pH 10.5) and 0.2M 
Li,SO, at 4 °C (277 K). Crystals appeared after several months, grew to maximum 
dimensions of 0.7 mm X 0.35 mm X 0.2 mm within weeks (Supplementary Fig. 10) 
and proved to be superior to the room-temperature crystal with respect to diffrac- 
tion properties. Crystals were mounted and flash-cooled in liquid nitrogen without 
additional cryoprotection. 

Characterization of CaM-binding sites using complementation tests and 
activity assays. Cam7 from A. thaliana and mammalian CaM alone were 
expressed in E. coli C41 and purified using standard His-tag purification protocols. 
Heterologous expression, yeast complementation tests, membrane purification 
and activity assays of full-length Aca8, N-terminal truncation and single-point- 
mutation constructs were performed essentially as previously described'*”*. The 
yeast strain K616 lacks the regulatory subunit B (Cnb1) of calcineurin in addition 
to the two Ca”*-ATPases (Pmcl and Pmrl). Under normal conditions, calci- 
neurin inhibits the low-affinity vacuolar Ca”*/H* antiporter Vcx1. Vex is the 
only transporter that can fill up the vacuolar Ca’* store in the mutant lacking the 
two Ca**-ATPases. At high Ca** concentration in the cytosol, Vcx1 transports 
Ca’* into the vacuole. Therefore, to get a viable yeast strain, the cnb1 mutation 
is required. The mutation of calcineurin activates the antiporter, which then 
removes Ca”* from the cytosol. At low Ca?* concentration, the K616 strain needs 
an active high-affinity transporter (Ca**-ATPase) to remove Ca’* from the cyto- 
sol; that is, complementation requires a constitutively active Ca** pump in the 
endoplasmic reticulum/Golgi apparatus apparently to scavenge trace Ca** for 
proper functioning of the secretory pathway. It has previously been shown that 
only expression of active Ca**-ATPases can rescue the growth defect in Ca**- 
depleted medium'». The growth rate of the mutant in Ca”* -depleted medium thus 
provides an in vivo assay for functional characterization of overexpressed recom- 
binant Ca** pumps from heterologous sources. 

The expression level of Aca8 and mutants in yeast microsomal membranes 

(20 ug) were determined by SDS-polyacrylamide gel electrophoresis. The bio- 
chemical characterization was carried out on microsomal membranes (5 jg). 
ATPase activity was determined with and without 10 11M Cam7 by varying the 
free Ca** concentration. The free Ca”* concentrations were determined using the 
Ca’**-sensitive dye fura-2 (ref. 30). Calibration curves were generated using 
the Calcium Calibration Buffer Kit #1 (zero and 10 mM CaEGTA) and fura-2 
pentapotassium salt (Invitrogen/Molecular Probes). Each buffer and ATPase assay 
sample (prepared as for the ATPase activity assay) was supplemented with 1 1M 
fura-2. For each calibration buffer and test sample, excitation scans were generated 
between wavelengths 250 and 450 nm (slit, 5 nm) while detecting the emission at 
wavelength 510 nm (slit, 5 nm) essentially as described in the provided protocol 
from Invitrogen. The HORIBA Jobin Yone FluoroMax-4 spectrofluorometer and 
the software FLUORESSENCE were used for the measurements. For the final 
calculations, fluorescence at 380 nm (F380) was used. The free Ca?* concentration 
in the calibration kit buffer was calculated using the specific Ka (EGTA) values at 
pH 7.3 (30°C). The addition of Cam7 in the absence of Ca”* did not result in any 
change in ATPase activity. Activity without Ca”* and Cam7 were subtracted from 
those with Ca** and with and without Cam7 to give the measured Ca”*-ATPase 
activity. 
Data collection and X-ray diffraction analysis. Diffraction data were collected to 
a resolution limit of 1.95 A. Two full data sets containing 200 oscillation images 
each with an interval of 1° were collected using the same crystal at a wavelength of 
0.9464 A at the ID23-2 beamline of the European Synchrotron Radiation Facility 
(France). Exposure times and crystal-to-detector distances were respectively 0.4 s 
and 472 mm (set one) and 0.5 s and 245 mm (set two). Low-resolution and high- 
resolution data sets were merged using XSCALE”. 

Reflections were indexed and scaled with XDS"*'. The crystals belonged to the 
tetragonal space group P4,2;2 and had unit-cell parameters a= b =71.25A, 
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c= 163.28A and «=f =y=90°. A summary of the data statistics is given in 
Supplementary Table 1. 
Structure determination. The structure was solved by molecular replacement 
using PHASER” with a systematic combination of search models (derived from 
various CaM complexes) and search parameters. An initial molecular replacement 
solution contained two molecules of CaM N-terminal domain and one molecule of 
CaM C-terminal domain and revealed clear extra density that could be unam- 
biguously assigned to Aca8R. Subsequent rounds of manual building using 
COOT” and refinement using PHENIX.REFINE™ allowed placement of the sec- 
ond CaM C-terminal domain and led to almost complete model building (five 
residues (76-80) in a flexible linker connecting Cam7 lobes bound to CaMBS2 
could not be traced). The final model yielded a crystallographic R-factor of 22.1% 
and a free R-factor of 25.3%. The model was validated using MOLPROBITY**”*. 
Evaluation of the Ramachandran plot showed all residues except one (99.7%) in 
allowed regions (97.0% in favoured regions), with Ser 82 situated in the flexible 
loop connecting the N- and C-terminal lobes of Cam7 (CaMBS1) as the single 
outlier. All figures were prepared using PYMOL”. The data are deposited in the 
Protein Data Bank. 
SAXS — data collection and ab initio modelling. SAXS data were collected at the 
X33 beamline at EMBL/DESY (Germany), following standard procedures. 
Repetitive data collection on the same sample was performed and no radiation 
damage was detected. Samples of (Cam7),—Aca8R solution were prepared in 
concentrations ranging from 4.5 to 9.0mgml * in a buffer containing 40 mM 
Tris/HCl (pH 7.4), 150 mM NaCl, 5 mM f-mercaptoethanol and 5 mM CaCl. All 
SAXS data were analysed using the package ATSAS**. Raw data were processed 
using PRIMUS”. The radius of gyration (R,) was evaluated using the Guinier 
approximation (J(s) = I(O)exp(—s*R,”/ 3) for sR, < 1.3, where I denotes intensity 
and s denotes momentum transfer) and also from the entire scattering curve with 
the program GNOM" the latter also provided the distance distribution function, 
p(r), and the maximum dimension, D,yax. The masses of the solutes were evaluated 
by comparison of the forward scattering intensity with that from a BSA reference 
solution (mass, 66 kDa). 

Low-resolution SAXS models were obtained using the ab initio simulated 
annealing program DAMMIN”, which generates models consisting of dummy 
atoms to fit the experimental data I.,,,(s) by minimizing the discrepancy 


2 1 Texp(Sj) = CTcatc(Sj) ? 
N-1 = ( o(s)) 


j 


Xx 


where Nis the number of experimental points, cis a scaling factor, and I.a-(s;) and 
o(s;) are the calculated intensity and the experimental error at the momentum 
transfer s;, respectively. Superimposition of low-resolution dummy atom models 
was performed using SUPCOMB”. 

Isothermal titration calorimetry. The thermodynamic parameters were deter- 
mined using an isothermal titration calorimeter (MicroCal ITC, GE 
Healthcare) at 15°C in a buffer containing 40 mM Tris/HCl (pH 7.4), 150 mM 
NaCl, 5 mM f-mercaptoethanol, 5 mM CaCl. For binding of Cam7 to Aca8(40- 
126) 400 1M Cam7 was titrated into the sample cell containing 20 4M Aca8(40- 
126). For binding of Cam7 to peptides corresponding to Aca8 CaMBS1 (residues 
42-65) and Aca8 CaMBS2 (residues 74-97) typically 150-400 LM of the peptide 
was titrated into the sample cell containing 7-40 1M Cam7. Reverse titration 
yielded identical thermodynamic parameters. For binding of mammalian CaM 
to peptides corresponding to the putative second CaM-binding site of mammalian 
PMCA1, PMCA2, PMCA3 and PMCA4 splice variants typically 0.6-1.2 mM CaM 
was titrated into the sample cell containing 50-100 uM peptide. Injection steps 
were 1.5 il (first injection, 0.3 pl) with 150-s spacing. Baseline corrections were 
performed by titrating protein or peptide into sample buffer and sample buffer 
into protein or peptide. Further data evaluation (Supplementary Table 2) was done 
using the MicroCal ORIGIN program. Peptides were purchased from GL Biochem 
(China; HPLC purified, >90% purity). 

The peptide sequences were as follows: Aca8-CaMBS1: ERLQQWRKA 
ALVLNASRRFRYTLD; Aca8-CaMBS2: EMRQKIRSHAHALLAANRFMDMGR; 
PMCA1-CaMBS2: HHDVTNISTPTHIRVVNAEFRSSLY; PMCA2-CaMBS2: SQD 
VANLSSPSRIRVVKAFRSSLY; PMCA3-CaMBS2: LHD VTNLSTPTHIRV VKAFR 
SSLY; PMCA4-CaMBS2: HLDVKLVPSSSYIKVVKAFHSSLH; PMCA4-CaMBS2- 
minus-anchor: HLDAKAVPSSSYIKV VKAAHSSLH. 

Bioinformatics analysis and homology modelling. Multiple sequence alignments 
were produced using CLUSTAL W*. A sequence motif was generated using 
WEBLOGO™. Homology models of Aca8 in the El and E2 conformations were 
computed with MODELLER* using the SERCA structures (Protein Data Bank IDs, 
3N5K and 3N8G) as templates and a manually optimized alignment as input. 
Conservation scores were calculated with CONSURF*. Docking of the autoinhibitory 
N terminus onto the Aca8 homology model was performed with HADDOCK** 
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using residues previously observed to be involved in autoinhibition’>** (Arg 58, Arg 61, 
Lys 67, Lys 68, Glu 252, Asp 303 and Asp332) as restraints. 

Mathematical modelling. We denote free CaM by Cp and CaM in complex with 
four Ca** ions by C4. The autoinhibited wild-type pump is denoted Pg, whereas P, 
and P, denote the pumps with one and, respectively, two CaM-binding sites 
occupied by C, (X, and X, in the Ca”* -bound state). Finally, P; denotes inorganic 
phosphorus. The wild-type model is represented by the following set of reactions: 


A 
Co +4Ca?* == Cy 
u 
mK 
Pp + Cy =P 
0 


my 
+C,——P,, 
1 


P 


ou cath ng v1 2+ 
P, +Ca’ —— X, —— P; + Ca +P; 
TM 


P)+Ca?t =X, —* 5 P) + Ca?t +P, 
72 
Assuming mass-action kinetics, the reactions give rise to a set of polynomial 
differential equations. Supplementary Fig. 7 shows a typical output obtained using 
the reaction rates 2= 10, W=5, % =10, By = 0.1, a, =1, 6) = 0.25, o, =1, 
tT, = 0.001, y; = 0.01, o2 = 2, t2 = 0.001 and y. = 0.1 and with total amount of 
CaM equal to 10 and the total amount of PMCA equal to 1. For varying total 
amounts of Ca?", the steady-state concentrations are obtained from numerical 
simulation of the system of differential equations using MATHEMATICA. The 
steady-state production rate of inorganic phosphorus 


d[Pi] 
dt 
where square brackets denote concentration, is used as a measure of pump activity 
and plotted as a function of the steady-state concentration of free calcium. The 
ACaMBS1 and ACaMBS1+2 models are derived from the wild-type model by 
leaving out Po and, respectively, Py and P, as well as all reactions in which the 
omitted pump or pumps participate. All simulations were started with 
[Ca?*] = Ca?* 4, and [Co] =C,, and with all PMCAs in the state with no 
CaM bound. That is, in states Po, Py and P2 for wild-type, ACaMBS1 and 
ACaMBS1+ 2, respectively. 
We note that the models are nested: all parameters in the ACaMBS1+2 and 
ACaMBS1 models are also parameters in the ACaMBS1 and, respectively, wild- 
type models. Simulations were made assuming that % > a, and o2 > 0), meaning 


=7[Xi] +72[X2] 


that the binding affinity of CaM to CaMBS1 was higher than that of CaM to 
CaMBS2 and that the activity of P, was higher than that of P;, respectively. 
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Endothelial cell expression of haemoglobin «a 
regulates nitric oxide signalling 


Adam C. Straub', Alexander W. Lohman’, Marie Billaud’?, Scott R. Johnstone’, Scott T. Dwyer’, Monica Y. Lee’, 
Pamela Schoppee Bortz!, Angela K. Best!, Linda Columbus*, Benjamin Gaston? & Brant E. Isakson!? 


Models of unregulated nitric oxide (NO) diffusion do not consistently 
account for the biochemistry of NO synthase (NOS)-dependent sig- 
nalling in many cell systems'*. For example, endothelial NOS con- 
trols blood pressure, blood flow and oxygen delivery through its effect 
on vascular smooth muscle tone’, but the regulation of these processes 
is not adequately explained by simple NO diffusion from endothelium 
to smooth muscle**. Here we report a new model for the regulation of 
NO signalling by demonstrating that haemoglobin (Hb) a (encoded 
by the HBA1 and HBA2 genes in humans) is expressed in human and 
mouse arterial endothelial cells and enriched at the myoendothelial 
junction, where it regulates the effects of NO on vascular reactivity. 
Notably, this function is unique to Hb a and is abrogated by its genetic 
depletion. Mechanistically, endothelial Hb o haem iron in the Fe** 
state permits NO signalling, and this signalling is shut off when Hb a 
is reduced to the Fe** state by endothelial cytochrome b5 reductase 3 
(CYB5R3, also known as diaphorase 1)°. Genetic and pharmacological 
inhibition of CYB5R3 increases NO bioactivity in small arteries. 
These data reveal a new mechanism by which the regulation of the 
intracellular Hb a oxidation state controls NOS signalling in non- 
erythroid cells. This model may be relevant to haem-containing 
globins in a broad range of NOS-containing somatic cells”. 

Endothelial NOS modulates blood vessel diameter in response to 
both vasodilators and vasoconstrictors. For example, it is known that 
during arterial constriction, NO from endothelium feeds back on 
smooth muscle to control the magnitude of the response to a vasocon- 
strictor (for example, phenylephrine)*'*. Phenylephrine stimulation of 
thoracodorsal arteries ex vivo—and of primary human endothelial 
cells (ECs) and vascular smooth muscle cells (SMCs) in the vascular 
cell co-culture (VCCC) model—reproduced classical NOS- and cyclic 
guanosine monophosphate (cGMP)-dependent changes in SMC 
biology (Supplementary Fig. la-d). However, NO did not diffuse into 
the extracellular space (Supplementary Fig. le-h), consistent with our 
previous work showing compartmentalized NOS signalling at the 
myoendothelial junction (MEJ), the crucial EC-SMC contact point 
in the thoracodorsal arteries and other small arteries and arterioles’. 
Therefore, we studied MEJ proteins that could contribute to local 
regulation of NO diffusion and biochemistry. We performed a proteo- 
mic analysis of MEJs isolated from VCCCs using the isobaric tags for 
relative and absolute quantification (1TRAQ) system (Supplementary 
Fig. 2). Surprisingly, Hb « was abundant at the MEJ (Supplementary 
Fig. 3). Because Hb can regulate NO diffusion and biochemistry in 
erythrocytes'*"*, we proposed that it could have a similar function at 
the MEJ. 

First, we confirmed the proteomic data using immunoblot and 
immunofluorescence. We demonstrated Hb protein expression in 
the VCCC model, but found no expression of Hb f (Fig. 1a). There was 
little Hb « expression in human ECs or SMCs grown separately, and 
there was no Hb « in the fibronectin or gelatin used to coat the VCCC 
transwells (Fig. 1a). Next, we confirmed these results in co-cultures of 


different types of EC and SMC in which MEJs also expressed abundant 
Hb « (Supplementary Fig. 4). We then studied the ME) distribution 
of Hb & in situ. Gold particles labelling Hb % were abundant in the MEJ 
of mouse thoracodorsal arteries visualized by transmission electron 
microscopy (TEM) (Fig. 1b). By contrast, carotid arteries (conduit 
arteries that have few MEJs) expressed little Hb « as observed by 
TEM (Fig. 1b), immunoblot (Fig. 1c) and immunofluorescence 
(Fig. 1d). These data were consistent in human skeletal muscle arter- 
ioles (Fig. 1d) and throughout several tissue beds (Supplementary 
Fig. 5). Using en face immunofluorescence, we found punctuate Hb 
a staining primarily at paracellular junctions of thoracodorsal, but not 
carotid, arteries, whereas little Hb B was observed (Fig. le). Chemical 
crosslinking analysis showed that the Hb % was monomeric in thor- 
acodorsal arteries and in VCCCs (Fig. 1f). Next, we measured Hb « 
messenger RNA using real-time PCR (Fig. 1g), and established that 
ECs transfected with Hb o short interfering RNA (siRNA) had 
decreased protein expression at the MEJ (Supplementary Fig. 6a) 
and in the endothelial monolayer (Supplementary Fig. 6b). Loss of 
Hb « protein expression did not affect endothelial NOS (eNOS) 
expression in the EC monolayer (Supplementary Fig. 6b) or at the 
ME] (Supplementary Fig. 7). 

Transcripts for other globins including myoglobin, neuroglobin and 
cytoglobin were absent in ECs (Supplementary Fig. 8a—c). Only cyto- 
globin mRNA and protein were expressed in SMCs (Supplementary 
Fig. 8c, d), consistent with a previous report''. In addition, we also 
found Hb «-stabilizing protein in the endothelium of thoracodorsal 
arteries and in VCCCs (Supplementary Fig. 9a, b). Taken together, 
these data show for the first time, to our knowledge, that arterial ECs 
express Hb « mRNA and protein, and are responsible for enriched Hb 
o expression at the ME). 

To investigate the functional role of Hb « in ECs and its effect on 
eNOS signalling, we transfected ECs in isolated thoracodorsal arteries 
with Hb « or control siRNA. Knockdown efficiency was 70-80% (Sup- 
plementary Fig. 10). Loss of Hb & resulted in a marked loss in arterial 
reactivity after phenylephrine application in a single or cumulative doses 
(Fig. 2a, b) and increased reactivity to acetylcholine (ACh) (Fig. 2c), 
but there was no change in the response to 5-hydroxytryptamine 
(5-HT) (Supplementary Table 1). Half-maximum effective concentra- 
tion (ECs9) and maximum effect (E,,ax) values are listed in Supplemen- 
tary Table 2. We observed no difference in basal tone (Supplementary 
Fig. 1la). However, after the addition of the NOS inhibitor L-N«- 
nitroarginine methyl ester (L-NAME), the effect of Hb « siRNA 
was comparable to control conditions for both the phenylephrine 
and ACh responses (Fig. 2a-c). We thus proposed that eNOS, the 
primary isoform in the vessel wall, may be in close spatial proximity 
to Hb «. We tested this hypothesis using four methods: co-localization 
studies by immunofluorescence (Fig. 2d, g), a proximity ligation assay 
(Fig. 2e), and co-immunoprecipitations from both cell lysates (Fig. 2f h) 
and purified proteins (Fig. 2i). These analyses showed that Hb « 
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Figure 1 | Monomeric Hb @ is expressed in ECs and enriched at the MEJ. 
a, Quantitative western blot analysis for Hb & and Hb f expression in coronary 
EC, ME] or SMC lysates from cells plated on Transwells or plastic (n = 4), or on 
fibronectin and gelatin used to coat Transwells. Red blood cells (RBCs) served 
as a positive control and GAPDH was used as a loading and normalization 
control for quantification (bottom left). Immunofluorescence for Hb « (red) 
and Hb (green) (bottom right). Ctrl, control. b, TEM analysis of Hb « 
expression in mouse thoracodorsal arteries (TDA; top) or carotid arteries 
(bottom), visualized using 10-nm gold beads (black particles). Arrow indicates 
ME); L denotes lumen. Graphs on the right represent Hb « localization 
calculated by measuring the number of beads per 10 um” (n = 7). 

c, Quantitative western blot analysis for Hb « and Hb f expression in isolated 
thoracodorsal or carotid arteries. Tubulin served as a loading control and red 
blood cells were used as a positive control (n = 3). d, Immunofluorescence of 
transverse sections from mouse carotid or thoracodorsal arteries or from a 


and eNOS are in a macromolecular complex and can form a direct 
protein-protein interaction. 

Hb « probably interacts with eNOS to regulate blood vessel tone by 
controlling NO diffusion through its scavenging by haem iron’*!7-”. 
We studied the mechanism of interaction by measuring loss of NO 
radical in thoracodorsal and carotid arteries, and in the VCCC model. 
NO was lost in thoracodorsal arteries, but not carotid arteries; it was 
also lost in MEJ fractions, but not in EC or SMC lysates (Supplemen- 
tary Fig. 12a, b). Next, we knocked down endothelial Hb « in isolated 
arteries (Fig. 2) or VCCCs (Fig. 21) using siRNA. Loss of Hb & increased 
NO diffusion across the vessel wall (Fig. 2k) and in VCCCs (Fig. 2m). 
Together, these results indicate that endothelial Hb « can regulate 
arterial tone through its effects on NO diffusion. 

Next, we proposed that Hb « haem iron in the oxygenated Fe’ * state 
should control NO diffusion through a fast reaction (2.4 10°M ‘s ‘)” 
resulting in dioxygenation’'’’, whereas Fe’* state should permit 
NO diffusion owing to a slower reaction rate (3.3 X 10°M's1)?3, 
We found that Hb « haem iron resides in both states. First, using 
ultraviolet-visible spectroscopy, we identified a Soret peak (~420 nm) 
and Q bands (~540-575 nm) in isolated thoracodorsal arteries consist- 
ent with oxygen-bound Hb Fe**, whereas there was no peak in carotid 
arteries (Fig. 3a). Next, we measured the oxidation state of Fe and 
found approximately 42% existed in the Fe7* and 58% in the Fe** 
state (Fig. 3b). These measurements were sensitive to Hb « siRNA 
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human skeletal muscle arteriole. In all images, red indicates Hb « expression, 
green shows internal elastic lamina (IEL) autofluorescence, and blue specifies 
nuclei. e, En face images of Hb & (red) and Hb (green) expression in ECs from 
thoracodorsal or carotid arteries. Blue staining represents nuclei. f, Western 
blot analysis of thoracodorsal artery, EC, MEJ, SMC or red blood cell lysates 
that were chemically crosslinked using BS; (bis(sulphosuccinimidyl)suberate) 
to determine the quaternary structure of Hb a. g, mRNA analysis from EC, MEJ 
and SMC lysates isolated from VCCCs, thoracodorsal and carotid arteries. 18S 
was used as a normalization factor. In a-c and g open bars represent in vitro 
data and striped bars indicate ex vivo data. Scale bars, 2 um (a), 0.5 um 

(b), 30 um (d, thoracodorsal artery and carotid) and 10 tm (d, human skeletal 
muscle artery, e). NS, not significant. P values are shown for each comparison, 
by analysis of variance (ANOVA) with Bonferroni post-analysis or Student’s 
t-test. All error bars represent s.e.m. 
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(Fig. 3b). Consistent with this observation, we found that carbon 
monoxide (CO) ligated Fe** haem, resulting in increased NO diffusion 
across isolated vessels (Supplementary Fig. 12c). When ME] fractions 
were studied, we found a Soret peak (~410 nm) characteristic of the 
Fe** state (methaemoglobin) (Fig. 3c). Interestingly, pelleted mem- 
branes from MEJ fractions were dark brown, consistent with Fe?t 
oxidation (Supplementary Fig. 13). We found approximately 32% of 
Fe existed in the Fe?~ state and 68% in the Fe** state (Fig. 3d), results 
that were also sensitive to Hb « siRNA (Fig. 3d). In addition, we 
observed an increase in NO diffusion in VCCCs treated with CO 
(Supplementary Fig. 12d). 

Previous work has demonstrated that the NO-haem Fe’ ~ interaction 
results in reductive nitrosylation, a mechanism known to generate 
S-nitrosothiols, which we have shown to be crucial for gap-junction 
regulation at the MEJ****. Using N-acetylcysteine as a bait reactant 
on the abluminal side (Supplementary Fig. 14a, c), we also found a 
notable loss of S-nitrosothiol synthesis after Hb « knockdown in thora- 
codorsal arteries (Supplementary Fig. 14b) and in VCCCs (Sup- 
plementary Fig. 14d). Together, these results suggest that the Hb « 
haem oxidation state regulates both NO diffusion and bioactivation. 

Next we determined the mechanism regulating the Hb « oxidation 
state. In erythrocytes, CYB5R3, a known methaemoglobin reduc- 
tase, controls the haem iron oxidation state through the reduction of 
Fe** (ref. 6). Using immunofluoresence (Fig. 4a, e (in vivo, in vitro, 
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Figure 2 | Hb a regulates vessel tone, NO 
diffusion and associates with eNOS. a-c, Time 
course effect of 50 1M phenylephrine (a), a dose 
naae response to phenylephrine (b) and a dose response 
tat to ACh (c) on thoracodorsal arteries treated with 
control or Hb siRNA in the presence or absence of 
L-NAME. n indicates the number of arteries; value 
in parenthesis shows number of mice. Asterisk (*) 
shows significance between control siRNA and Hb « 
siRNA; hash symbol (#) indicates significance 
between Hb « siRNA and Hb & siRNA plus 
L-NAME; dagger (+) represents significance 
between control and control plus L-NAME. d, En 
face view of a dual immunofluorescence of a mouse 
thoracodorsal artery showing Hb « (red) and eNOS 
(green). The white box in the merge panel indicates 
the region of interest magnified in the far-right 
panel. e, Proximity ligation assay for Hb « and eNOS 
(red punctates) in transverse mouse thoracodorsal 
artery sections. Inset shows the negative control. 

f, Western blot (WB) analysis from samples co- 
immunoprecipitated (IP) for Hb « and probed for 
eNOS from isolated thoracodorsal and carotid 
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immunoprecipitation of Hb « and eNOS expression 
in VCCC lysates. i, Co-immunoprecipitation of 
purified Flag-eNOS protein probed for Hb «. 

j, Schematic diagram of experimental design 
illustrating a cannulated vessel transfected with Hb « 
siRNA showing NO diffusion as a readout. k, NO 
diffusion results from mouse thoracodorsal arteries 
transfected with control or Hb & siRNA (n= 5). 

1, Illustration of experimental set up for VCCC 
experiments. m, NO diffusion results from VCCCs 
transfected with control or Hb « siRNA (n= 4). 
Striped bars in k represent ex vivo data; open bars in 
m indicate in vitro data. Scale bars, 10 ttm (d, e) and 
1 um (g). P values are shown for each comparison, 
by ANOVA with Bonferroni post-analysis. All error 
bars represent s.e.m. 


respectively)), TEM (Fig. 4b), and western blot analysis (Fig. 4c, d 
(in vivo, in vitro, respectively)), we identified that CYB5R3 was 
expressed in ECs and at the MEJ. In addition, we established 
CYB5R3 is in a complex with Hb « using four separate assays: immu- 
nofluorescence (Fig. 4f, g), proximity ligation assay (Fig. 4h), and co- 
immunoprecipitation from cell lysates and purified proteins (Fig. 4i). 
Indeed, molecular modelling of the crystal structures for Hb «, eNOS 
and CYB5R3 revealed a discrete region of high probability where the 
proteins could interact (Supplementary Fig. 15). Next, we used human 
CYB5R3 siRNA (knockdown efficiency: ~50%; Supplementary Fig. 16a) 
and CYB5R3 overexpression to show that CYB5R3 regulates methae- 
moglobin « reduction. Time-lapse ultraviolet-visible spectrometry 
demonstrated that the loss of CYB5R3 protein inhibited methaemo- 
globin « reduction and that overexpression enhanced methaemoglo- 
bin « reduction (Supplementary Fig. 16b, c). To determine whether 
CYB5R3 expression or activity regulates arterial tone, we tested both 
siRNA directed against endothelial CYB5R3 in thoracodorsal arteries 


Figure 3 | The oxidation state of Hb a resides in a mixture of Fe”* and Fe**. 
a, c, Ultraviolet-visible spectroscopy analysis of thoracodorsal arteries (a) and 
VCCC fractions (c). The inset in a indicates the region of interest (magenta box) 
of the Soret (~420 nm) and Q bands (~540-575 nm). a.u., arbitrary units. 

b, d, Measurement of the Hb « oxidation state calculating the ratio of Fe?t 
Fe** in thoracodorsal arteries (b; n = 3) and VCCC fractions (d; n = 3) with 
and without Hb « siRNA. Striped bars in b indicate ex vivo data; open bars in 
d represent in vitro data. K;[Fe(CN)g], potassium ferricyanide. P values are 
indicated for each comparison, by Student’s t-test. All error bars represent s.e.m. 
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Figure 4 | CYB5R3 expression and activity are crucial for vasomotor tone 
and NO diffusion. a, Immunofluorescence of CYB5R3 expression (red) and 
nuclei (blue). Green represents autofluorescence from internal elastic lamina. 
b, TEM analysis of CYB5R3 expression at the MEJ (black particles) in vivo. 
c, d, Western blot analysis of CYB5R3 in thoracodorsal arteries (c) and VCCCs 
(d). e, Immunofluorescence of CYB5R3 expression in VCCCs. Red 

shows CYB5R3 and green indicates F-actin. f, En face view of a dual 
immunofluorescence labelling of a mouse thoracodorsal artery showing Hb a 
(red) and CYB5R3 (green) in the top panels. White box in the merge image in 
the bottom left panel shows the region of interest magnified in the bottom right 
panel. g, Colocalization of CYB5R3 (red) and Hb «& (green) on a transverse 
section from the VCCC. h, Proximity ligation assay for Hb « and CYB5R3 (red 
punctates) on transverse mouse thoracodorsal artery sections. Inset shows the 
negative control. Green shows internal elastic lamina autofluorescence. 

i, Western blot analysis from samples co-immunoprecipitated for Hb « and 
probed for CYB5R3 from isolated thoracodorsal and carotid arteries, VCCCs or 
purified proteins. j-I, Time course effects of 50 uM phenylephrine (j), a dose 


anda pharmacological inhibitor of CYB5R3, propylthiouracil (PTU)”*. 
Knockdown efficiency using mouse Cyb5r3 siRNA was about 70% 
(Supplementary Fig. 17a). We observed a decrease in arterial reactivity 
in thoracodorsal arteries transfected with CYB5R3 siRNA after pheny- 
lephrine stimulation with a single dose or cumulative concentrations 
(Fig. 4j, k), and increased arterial reactivity with increasing ACh doses 
(Fig. 41). Vascular reactivity to phenylephrine or ACh in thoracodorsal 
arteries pretreated with PTU is shown in Supplementary Fig. 18a-c. 
The PTU effect was not reversible with L-thyroxine supplementation 
after phenylephrine stimulation (Supplementary Fig. 18b, inset). We 
found no change in the PTU effect after 5-HT addition (Supplemen- 
tary Table 1). ECs9 and Ey,ax values are in Supplementary Table 2. 
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response to phenylephrine (k) and a dose response to ACh (I) on thoracodorsal 
arteries treated with control or Hb o siRNA in the presence or absence of 
L-NAME. n indicates the number of arteries; value in parenthesis shows 
number of mice. Asterisk (*) shows significance between control siRNA and 
CYB5R3 siRNA; hash symbol (#) indicates significance between CYB5R3 
siRNA and CYB5R3 siRNA plus L-NAME; dagger (+) represents significance 
between control and control plus L-NAME. m, Schematic of experimental set 
up for NO diffusion assay in a cannulated artery that was transfected with 
CYB5R3 siRNA. n, Results from NO diffusion experiment in mouse 
thoracodorsal arteries with genetic knockdown of CYB5R3 expression (n = 3). 
o, Illustration showing the experimental set up for VCCC experiments. p, NO 
diffusion results from VCCCs transfected with control or CYB5R3 siRNA 

(n = 4). Striped bars in n indicate ex vivo data; open bars in p represent in vitro 
data. Scale bars, 10 um (a, f, h), 0.25 tum (b), 5 jum (e) and 1 jum (g). P values are 
indicated for each comparison, by ANOVA with Bonferroni post-analysis. All 
error bars represent s.e.m. 


However, after the addition of L-NAME, the effects of CYB5R3 
siRNA (Fig. 4j-l) or of PTU treatment (Supplementary Fig. 18a, b) 
were comparable to control conditions, results that were consistent 
with Hb o knockdown. We found no difference in basal tone for 
CYB5R3 siRNA or PTU (Supplementary Fig. 11a, b). Next, we tested 
the effect of CYB5R3 on NO diffusion in vessels and VCCCs (Fig. 4m, 
0). Knockdown efficiency with CYB5R3 siRNA was ~30% at the MEJ 
(Supplementary Fig. 17b) and in the EC monolayer, but not in SMCs 
(Supplementary Fig. 17c). Both CYB5R3 siRNA and PTU treatment 
increased NO diffusion across isolated vessels and in VCCCs (Fig. 4n, p 
and Supplementary Fig. 18d—g). Note that CYB5R3 knockdown did 
not alter MEJ eNOS or Hb « expression (Supplementary Fig. 17d). 
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We conclude that EC expression of Hb « has a crucial role in the 
regulation of NOS-mediated signalling and in the control of arterial 
vascular reactivity. These results may have far-reaching implications 
that could influence many aspects of vascular biology and disease. For 
example, endothelial Hb « expression may participate in blood pres- 
sure control, arteriogenesis and anti-inflammatory signalling, as well 
as affecting other redox signalling molecules (for example, superoxide 
and hydrogen peroxide). Our results correlate with diagnostic indices 
for human o-thalassemia major (HBA1~‘~ HBA2~‘~) fetuses, who 
show increased cerebral blood flow during development”. Furthermore, 
these observations may help to explain why inhibition of CYB5R3 
attenuates hypertension® and may suggest that CYB5R3 is a new 
therapeutic target for disease treatment. However, studies devoted 
towards understanding the mechanisms of CYB5R3 regulation and 
its interaction with Hb o will need to be clarified. More broadly, somatic 
cell types as diverse as alveolar epithelial cells’, macrophages’, neurons’® 
and renal mesangial cells* express both Hb and NOS. It is thus possible 
that Hb could regulate NOS signalling pathways relevant to many cell 
and organ systems. Taken together, these data provide evidence for 
a new model in which somatic cell Hb oxidation is required for NOS- 
dependent bioactivity. 


METHODS SUMMARY 


Human coronary ECs and SMCs were co-cultured and fractionated as previously 
described”. iTRAQ proteomic screening was used to identify and quantify pro- 
teins enriched at the MEJ as previously demonstrated*. Proteins were analysed 
using western blot, immunofluorescence and immune-TEM. mRNA was measured 
using real-time-PCR. Isolated thoracodorsal arteries were cannulated, pressurized 
and stimulated with phenylephrine or ACh as previously shown’ or perfused 
with anaerobic aqueous NO. Detailed methods can be found in Supplementary 
Methods. 
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The single-component type-II NADH dehydrogenases (NDH-2s) serve 
as alternatives to the multisubunit respiratory complexI (type-I 
NADH dehydrogenase (NDH-1), also called NADH:ubiquinone oxi- 
doreductase; EC 1.6.5.3) in catalysing electron transfer from NADH to 
ubiquinone in the mitochondrial respiratory chain’. The yeast NDH-2 
(Ndil) oxidizes NADH on the matrix side and reduces ubiquinone to 
maintain mitochondrial NADH/NAD* homeostasis. Ndil is a poten- 
tial therapeutic agent for human diseases caused by complexI 
defects*’, particularly Parkinson’s disease, because its expression 
restores the mitochondrial activity in animals with complex! defi- 
ciency. NDH-2s in pathogenic microorganisms are viable targets for 
new antibiotics'®". Here we solve the crystal structures of Ndil in its 
substrate-free, NADH-, ubiquinone- and NADH-ubiquinone-bound 
states, to help understand the catalytic mechanism of NDH-2s. We find 
that Ndil homodimerization through its carboxy-terminal domain is 
critical for its catalytic activity and membrane targeting. The structures 
reveal two ubiquinone-binding sites (UQ; and UQ;;) in Ndil. NADH 
and UQ, can bind to Ndil simultaneously to form a substrate-protein 
complex. We propose that UQ; interacts with FAD to act as an inter- 
mediate for electron transfer, and that NADH transfers electrons 
through this FAD-UQ,; complex to UQy. Together our data reveal 
the regulatory and catalytic mechanisms of Ndil and may facilitate 
the development or targeting of NDH-2s for potential therapeutic 
applications. 

Electron transport chains, which comprise oxidative phosphoryla- 
tion complexes I to V in the inner membrane of mitochondria, serve as 
the core components during respiration’. Complex! catalyses the 
transfer of two electrons from NADH to reduce ubiquinone to ubiqui- 
nol, and is the entry point for a large fraction of the electrons that 
traverse the respiratory chain'*"*. This catalysis is the rate-limiting step 
in respiration and is central to energy metabolism’’. Three different 
types of NDH are known: proton-pumping NDH (NDH-1); non-pro- 
ton-pumping NDH (NDH-2); and the rare sodium-pumping NDH, 
called NQR*"°. 

First identified in domestic yeast’”"*, NDH-2 is a ~50-kDa, mono- 
topic membrane protein with no energy-transducing capacity. 
Subsequently, multiple NDH-2s were found in various fungi, plants 
and primitive eukaryotic cells’!. Genome-wide analysis has shown 
that a majority of organisms, such as Escherichia coli, contain both 
NDH-1 and NDH-2. However, some species have only NDH-2 
(Supplementary Fig. 1, Supplementary Data 1 and Supplementary 
Table 1). Ndil, one of the NDH-2s in Saccharomyces cerevisiae, faces 
the inside surface of the mitochondrial inner membrane, similar to 
NDH-1”*. Here we report the crystal structures of Ndil in its apo and 
substrate-bound states. 

Ndil forms a globular «/f structure that can be divided into three 
domains (Fig. 1a and Supplementary Fig. 2). Similar to other NADH- 
and FAD-associated enzymes, Ndil contains two canonical Rossmann 


domains with an FAD molecule buried deep in the first one (residues 
49-412, referred to as the active domain hereafter) (Fig. 1 and 
Supplementary Fig. 3). A search of the Dali database”’ revealed that 
the NADH- and FAD-associated enzymes are the closest structural 
homologues of the Ndil active domain, with most of them having a 
root mean squared deviation (r.m.s.d.) ofless than 2 A when compared 
with the Ndil active domain. The structures of Ndil and other 
NADH- and FAD-associated enzymes differ in their C-terminal 
domains (CTDs; Fig. 1b). Ndil CTD is highly conserved among 
NDH-2s from different species (Supplementary Fig. 2 and Sup- 
plementary Data 1 and 2) and contains three B-strands (§19-$21) 
and two a-helices («15 and 16) (Fig. la and Supplementary Fig. 2). 
The two a-helices form a helix—turn-helix motif that extensively inter- 
acts with the active domain, whereas the three B-strands make fewer 
contacts with the active domain (Supplementary Fig. 4). 

In all the structures, each asymmetric unit contains an Ndil homo- 
dimer (Fig. 2a). Interactions between the two Ndil monomers are 
mainly mediated through helix «16 of the CTD (Fig. 2b and 
Supplementary Fig. 5a, b), burying a surface area of 5,982 A. 
Consistent with this, we eluted wild-type Ndil from a gel filtration 
column with a native molecular mass of ~150 kD, corresponding to 
the calculated size of an Ndil dimer (Supplementary Fig. 6a). The 
linear relationship between Ndil concentration and its activity indi- 
cated that a single enzyme catalysed the electron transfer reaction 
(Supplementary Fig. 6b). To investigate whether Ndil was oligomeric 
in vivo, we co-expressed Ndil-FLAG and Ndil-Myc proteins in yeast 
cells and performed immunoprecipitation assays. As shown in Fig. 2c, 
the two Ndil proteins interacted with each other, indicating that Ndil 
formed an oligomer in yeast cells. Deletion of «16 compromised the 
interaction between Ndil-FLAG and Ndil-Myc, and deletion of both 
«15 and «16 abolished the interaction (Fig. 2c). This result indicated 
that the CTD is important for Ndil oligomerization. We further con- 
firmed the functional importance of the CTD through a rescue assay. 
The plasmid carrying wild-type Ndil but not the mutants with «16 
or «15 and «16 deleted, rescued the yeast ndilA phenotype 
(Supplementary Fig. 5c). Thus, Ndil functions as a homodimer both 
in vitro and in vivo. 

The membrane anchoring of E. coli NDH-2 is required for its activ- 
ities’*. One side of the Ndil homodimer contains a large hydrophobic 
surface, which is formed by the solvent-exposed residues from the 
CTD and surrounded by numerous positively charged patches 
(Fig. 2a, d). In solution, the hydrophobic surface was shielded by the 
detergent molecules used for protein purification, because several 
detergent molecules bound to this region in the structures of Ndil 
(Supplementary Fig. 7). This suggested that the hydrophobic surface 
could have a role in the mitochondrial membrane localization of Ndil. 
Indeed, fractionation indicated that wild-type Ndil protein was exclu- 
sively found in the membrane fraction. In contrast, a large pool of the 
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Figure 1 | Ndil has a unique CTD. a, Overall structure of Ndil. The CTD 
(residues 439-513) of Ndil is coloured in pink and the active domain in cyan. 
FAD is shown in stick representation. b, Two views of the structural alignment 


Figure 2 | The CID mediates Ndil homodimerization and membrane 
attachment. a, Ndil forms a homodimer in crystals. One protomer is coloured 
as in Fig. 1a and the other one is coloured in orange (CTD) and lime 
(remainder). The conserved, positively charged residues on the membrane 
attachment side of one protomer are shown as yellow sticks. b, Detailed 
interactions between «16 of the two Ndil protomers. c, Ndil forms a dimer in 
vivo. Top: western blots of lysates (input) of ndilA trp1A cells expressing the 
wild-type and the mutant forms of Ndil-Myc, Ndil-FLAG or both. Proteins 
were detected by western blot with anti-Myc and anti-FLAG antibodies. 
Bottom: Cell lysates in the top panels were subjected to anti-FLAG or anti-Myc 
immunoprecipitation (IP) followed by anti-Myc and anti-FLAG 
immunoblotting (WB). d, A large hydrophobic surface formed by the CTD in 


between Ndil and the Pdr-Pdx complex from Pseudomonas putida (Protein 
Data Bank ID, 3LB8). The CTD of Pdr is coloured in orange and the remaining 
part in green. Colour codes for Ndil are the same as those in a. 
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the Ndil homodimer. Left: electrostatic surface of the Ndil homodimer. The 
hydrophobic surface is highlighted with a dashed box. Right: the side chains of 
the surface residues from 15 and «16 of the CTD. For clarity, only one Ndil 
monomer is shown. e, The CTD is essential for Ndil membrane attachment. 
The lysates of ndilA cells transformed with vectors encoding the indicated 
forms were fractionated into three samples: total protein, membrane fraction 
and supernatant. The samples were subjected to western blots. The loading 
amounts of the samples were examined using anti-Porin for the membrane 
fractions and anti-PGK1 antibody for the total protein and supernatant 
fractions. f, Mutations compromising Ndil membrane attachment cause 
growth defects in yeast cells. 
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mutant Ndil protein with «15 and «16 deleted was detected in the 
supernatant fraction (Fig. 2e), supporting a role for the two a-helices in 
Ndil membrane anchoring. The two a-helices were required for yeast 
viability, because their deletion resulted in a lethal phenotype (Fig. 2f). 

We then solved the structure of the NADH-bound Ndil. As 
observed in other NADH and FAD reductases, NADH binds to the 
second Rossmann domain and is fully accessible to FAD from the 
matrix side (Supplementary Fig. 8). Stabilized by a number of hydro- 
gen-bonding interactions, NADH binding causes a slight conforma- 
tional change in this domain (Supplementary Fig. 9a, b). Supporting 
their functional importance, single mutations of the NADH-interact- 
ing residues caused a lethal phenotype in yeast (Supplementary Fig. 
9c). The nicotinamide ring of NADH is orientated in parallel with the 
re-face of the FAD isoalloxazine ring, with the distance between C4 of 
NADH and N5 of FAD being 3A, indicating that a direct electron 
transfer between them can occur. 

The structure of the ubiquinone-bound form was determined by co- 
crystallizing Ndil with ubiquinone in the absence of NADH. One 
ubiquinone molecule (UQ;) with a well-defined density is located close 
to FAD (Fig. 3a) and lies in the groove formed by B19-B21 and «15 
from the CTD (Supplementary Figs 10a, b). The distance between the 
centre of the FAD isoalloxazine ring and UQ; is about 7.8 A. This site 
was occupied by a Triton X-100 (TRT) molecule in the apo structure 
(Supplementary Fig. 7), explaining why the protein extracted with 
n-dodecyl-B-p-maltopyranoside (DDM) but not with TRT contained 
bound ubiquinone**™. Mutations of the residues that interact 
with UQ; markedly attenuated the enzymatic activity of Ndil and 
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Figure 3 | Electron transfer of Ndil involves two ubiquinone molecules. 

a, Two ubiquinone molecules bind to Ndil. The ubiquinone-binding sites in 
the Ndil-ubiquinone complex are shown in ribbon representation. The 
ubiquinone and FAD molecules are shown as sticks and labelled. The F, — F- 
omit density (green) is contoured at 3.56. b, Ribbon representation of the 
reaction centre in the Ndil-NADH-ubiquinone complex. The F, — F, omit 
density (green) is contoured at 3.50. c, Generation of Ndil-catalysed radicals 
requires both NADH and ubiquinone. The EPR spectra of the g = 2.003 signals 
(g, electron g-factor) arise from ubisemiquinones of Ndil proteins as indicated. 


480 | NATURE | VOL 491 | 15 NOVEMBER 2012 


compromised yeast survival (Supplementary Figs 10c, d). Although 
comparatively less well defined, a second ubiquinone molecule (UQ;;) 
was assigned to the stretch of density adjacent to UQ; (Fig. 3a). This 
assignment was consistent with the recent studies indicating that Ndil 
contains both a bound ubiquinone and a catalytic ubiquinone at sim- 
ilar sites****. Both ubiquinone molecules are located on the si-face of 
the isoalloxazine ring of FAD and extend their aliphatic side chains 
towards the large hydrophobic surface (Supplementary Fig. 10a). The 
less well-defined UQy forms fewer contacts with Ndil than does UQ; 
(Supplementary Fig. 10b). 

Putidaredoxin reductase (Pdr) is an NADH-dependent enzyme and 
transfers electrons to P450 through forming a complex with the [2Fe- 
2S]-containing protein putidaredoxin® (Pdx). As anticipated, the 
structure of Pdr resembles that of the Ndil active domain. UQy, in 
Ndil is similarly positioned relative to the [2Fe-2S] cluster in Pdx. The 
distance between the centre of the isoalloxazine ring and the UQy 
centre is about 11.2A, comparable to the 12A distance between 
FAD and the [2Fe-2S] cluster in the Pdr-Pdx complex (Sup- 
plementary Fig. 11). These structural observations suggest that it is 
spatially possible for the electron to be transferred from reduced FAD 
(FADH,) to UQy. 

Previous studies suggested that Ndil could form a ternary complex 
with NADH and ubiquinone”. We therefore soaked the ubiquinone- 
bound Ndil crystals with NADH and solved the structure of the Ndil- 
NADH-ubiquinone complex (Fig. 3b). UQn was not observed in the 
complex. Because the crystals of the complex were obtained in the 
presence of oxygen, it might represent an enzyme-substrate complex 
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AHpp, peak-to-peak width. d, Ndil-catalysed generation of two populations of 
radicals. The power saturation behaviours of the g = 2.003 EPR signals of Ndil 
were monitored at 40 K. All experimental parameters except the microwave 
power were the same as in c. The ubisemiquinone signal of Ndil(DDM)- 
NADH (red) was fitted by the equation described in Methods, and that of the 
sample with excessive ubiquinone (black) was resolved as the sum of two 
components. The theoretical curves for these two components are shown by the 
purple and blue dotted lines. Two sets of half-saturation parameters (P)/2) and 
the relative concentration ratio are indicated. 
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rather than a metastable enzyme-intermediate complex. Comparison 
with the apo-Ndil structure shows that Ndil undergoes almost 
no conformational changes when bound by ubiquinones (r.m.s.d, 
0.22 A) or NADH-ubiquinone (r.m.s.d, 0.26 A). The UQ;-binding site 
is almost identical in the two structures, suggesting that UQ; may act 
as a cofactor together with FAD to mediate electron transfer 
(Supplementary Fig. 12). 

Our structures reveal that Ndil possesses a unique CTD (Fig. 1 and 
Supplementary Fig. 2) that mediates the Ndil dimerization required 
for its activity (Figs 2a-c and Supplementary Fig. 5). The CTD- 
mediated Ndil dimerization creates a large hydrophobic surface 
at one side (Fig. 2d) and a hydrophilic groove at the other 
(Supplementary Fig. 13). The biochemical assay (Figs 2e, f) supports 
there being a role for the hydrophobic surface in targeting Ndil to the 
mitochondrial membrane. The positively charged patches around the 
hydrophobic surface could also contribute to the membrane local- 
ization of Ndil, presumably through interactions with the phosphate 
groups of membrane-localized phospholipids. Further investigations 
are needed to understand the specific mitochondrial membrane tar- 
geting of Ndil. Additionally, the CTD is directly involved in the recog- 
nition of ubiquinones by Ndil (Figs 3a, b). Taken together, we 
conclude that the unique CTD of the single-component enzyme 
Ndil enables it to perform the electron transfer function of the 
multi-component NDH-1s. 

Like many ubiquinone-reactive enzymes*”~*°, Ndil has two ubiqui- 
none-binding sites (Fig. 3a). UQ; may not readily exchange with the 
ubiquinone pool, because it forms extensive interactions with the CTD 
of Ndil. In contrast, UQy may better serve this purpose because it has 
much fewer contacts with Ndil (Supplementary Figs 10a, b). Our 
structures suggest that there is an electron transfer pathway from 
NADH to UQy, with UQ; acting as an intermediate together with 
FAD. FAD first accepts two electrons from NADH to form FADH). 
Several possible mechanisms could be responsible for subsequent elec- 
tron transfer. FADH, could transfer one electron to UQ; and one to 
UQ;, reducing both to ubisemiquinones. Then a disproportionation 
reaction between them could occur, restoring UQ; to its previously 
oxidized state and converting UQy into ubiquinol. Alternatively, 
FADH, could transfer the electrons one by one to UQ;; through the 
bound UQ, reducing UQy to ubiquinol. It is also possible that FADH, 
transfers both electrons successively to UQ; and that the two electrons 
are then further transferred to UQy consecutively. The net results of 
these three different pathways are the same: two electrons from NADH 
are transferred through the FAD-UQ; complex to UQy. All three 
mechanisms involve ubisemiquinone radicals. Indeed, we detected 
the NADH-ubiquinone-dependent ubisemiquinone radicals in the 
Ndil-catalysed electron transfer reaction using electron paramagnetic 
resonance (EPR) (Fig. 3c). Moreover, the power saturation experi- 
ments suggested the existence of two different populations of ubisemi- 
quinone radicals (Fig. 3d). Future studies, such as EPR experiments at 
different pHs and redox titrations, are needed to establish the detailed 
electron transfer pathway of Ndil. 

Our study demonstrates that the electron transfer activity of the 
single-component Ndil is enabled by its unique CTD. Our data also 
support a model in which Ndil catalyses the electron transfer from 
NADH to UQy through a mechanism mediated by the FAD-UQ; 
intermediate. This information could be valuable in guiding the engin- 
eering of Ndil for therapy and the design of antiparasitic inhibitors of 
NDH-2s. 


METHODS SUMMARY 


The wild-type and mutant forms of Ndil were expressed and purified from E. coli 
strain C43. Crystals were grown at 18°C by the hanging-drop vapour diffusion 
method. Diffraction data sets were collected at beamline BL17U of the Shanghai 
Synchrotron Radiation Facility. The structures were solved by single-wavelength 
anomalous diffraction and molecular replacement methods. Data collection and 
structure refinement statistics are summarized in Supplementary Table 2. The 
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EPR data were collected at the High Magnetic Field Laboratory of Chinese 
Academy of Sciences. 


Full Methods and any associated references are available in the online version of 
the paper. 
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METHODS 


Genome-wide analysis of phylogenetic relationships. NDH sequences from 
representative species of Prokaryota, Protista, Plantae, Fungi and Metazoa were 
used to investigate the evolutionary history of this gene family. Because some of the 
selected species were available only with draft genomes, two methods were used to 
obtain the sequences. The genes from completely sequenced genomes were 
obtained from KEGG (the Kyoto Encyclopedia of Genes and Genomes) 
ORTHOLOGY Database (KEGG ORTHOLOGY Entry K03885). (The locus tags 
of NDHs are listed in Supplementary Table 1.) The genes from draft genomes 
(Saccharomyces mikatae and Saccharomyces bayanus; see Supplementary Data 2) 
were identified by TBLASTN searches using three full-length NDH protein 
sequences of S. cerevisiae. The collected sequences were aligned using the 
MUSCLE (multiple sequence comparison by log-expectation) program*!. The 
alignment was further adjusted manually using BIOEDIT”. Phylogenetic relation- 
ships among the NDHs were reconstructed using a maximum-likelihood proced- 
ure in PHYML* with the Jones-Taylor-Thornton amino-acid substitution model 
and 100 bootstrap replicates. (The tree and alignment are available as Sup- 
plementary Fig. 1 and Supplementary Data 1 and 2.) Furthermore, we obtained 
the subunit numbers of NADH:ubiquinone oxidoreductase (complex I) of each 
species from the KEGG PATHWAY Database (pathway entry, ko00190). 
Protein expression and purification. The complementary DNA of Ndil from 
S. cerevisiae was cloned into pQE-80L vector. The Ndil mutants were generated by 
two-step PCR and were subcloned, overexpressed and purified in the same way as 
wild-type protein. Selenomethionine (Se-Met)-labelled protein was purified simi- 
larly. Overexpression of Ndil was induced in E. coli strain C43 using 0.5mM 
isopropyl-B-p-thiogalactopyranoside when the cell density reached Déoonm = 
1.0. After growth for 10-12h at 35°C and 80r.p.m., the cells were collected, 
homogenized in a buffer containing 50 mM Tris (pH 8.0) and 1 mM EDTA, and 
disrupted by sonication. Cell debris was removed by low-speed centrifugation for 
15min. The supernatant was collected and applied to ultracentrifugation at 
150,000g for 1h. The membrane fraction was collected and suspended in 
50mM Tris (pH7.6), 200mM NaCl, 0.1'mM EDTA and 10% glycerol 
(buffer A). After the addition of Triton X-100 at a final concentration of 0.3% 
(w/v) and 1 mM PMSF, the sample was incubated for 2 h with slow stirring at 4 °C. 
Following another ultracentrifugation step at 150,000g for 30 min, the supernatant 
was collected and loaded onto Ni**-nitrilotriacetate affinity resin (Ni-NTA, 
Qiagen) pre-equilibrated with buffer A containing 0.1% Triton X-100. Sub- 
sequently, the resin was washed three times, each time with 5 ml of buffer A 
containing 0.1% Triton X-100 and 8 mM histidine. Then the protein was eluted 
with buffer A containing 0.02% Triton X-100 and 200 mM histidine. The purified 
Ndil protein was concentrated and subjected to gel filtration chromatography 
(Superdex-200 10/30, GE Healthcare) in a buffer containing 25 mM Tris (pH 7.6), 
300 mM NaCl, 5mM DTT, 0.1mM EDTA and 0.02% Triton X-100. The peak 
fractions were collected for biochemical analyses or crystallization. 
Crystallization, data collection and structure determination. Crystals were 
grown at 18°C by the hanging-drop method by mixing 1 ul of protein (10 mg 
ml‘) with 1 pl of reservoir solution (3.8 M potassium formate, 2% w/v polyethyl- 
ene glycol monomethyl ether 2,000 (pH 4.2 or 10.2)). The crystals appeared 
overnight and grew to full size in about four to five days. To obtain the NADH- 
bound Ndil crystals, we soaked the crystals in cryo-protectant with 10 mM NADH 
for about 1-3 min. To obtain the ubiquinone-bound Ndil crystals, we made fresh 
50 mM ubiquinone-4 (UQ,) stock solution in DMSO. Ndil and ubiquinone were 
co-crystallized with a final concentration of 0.5mM UQ,. To get the NADH- 
ubiquinone-bound Ndil crystals, we soaked the ubiquinone-bound Ndil crystals 
in cryo-protectant with 10 mM NADH for about 1-3 min. The crystals were then 
transferred to liquid nitrogen. 

All the data were collected at Shanghai Synchrotron Radiation Facility beamline 
BL17U and were integrated and scaled using the HKL2000 package**. Further 
processing was carried out using programs from the CCP4 suite”. Data collection 
statistics are summarized in Supplementary Table 2. The self-rotation function 
with POLARREN revealed a prominent peak at the section of k = 180°, indicating 
the presence of a non-crystallographic two-fold symmetry axis. SHELXD*° was 
used to locate the positions of selenium sites, and HA_NCS*’ independently 
identified a two-fold non-crystallographic symmetry axis from the anomalous 
sites. The non-crystallographic symmetry from selenium sites has the same ori- 
entation as that from the self-rotation function, which verified the validity of the 
anomalous scatter positions. The identified anomalous scattering sites were input 
to PHASER® for single-wavelength anomalous diffraction phasing. The real-space 
constraints were applied to the electron density map in DM”. The final electron 
density map was of sufficient quality for BUCCANEER” to be able to build almost 
the complete model. The final model rebuilding was performed using COOT"! and 
the protein structure was refined with PHENIX” against the high-resolution native 
data using non-crystallographic symmetry and stereochemistry information as 
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restraints. Other structures were solved using molecular replacement. Data 
collection and structure refinement statistics are summarized in Supplementary 
Table 2. 

Yeast strains and plasmids. The strains used in this study were derived from 
haploid strain BY4742 (MATz« his3A1 leu2A0 lys2A0 ura3A0). ndi1A was gener- 
ated by disruption of the NDI1 gene through homologous recombination with 
KanMX4 as the replacement marker. Screen of positive colonies was achieved by 
spreading cells on YPD (yeast extract/peptone/dextrose) medium plus 0.4mg 
ml’ G418. Growth defects on YPG (yeast extract/peptone/glycerol) medium 
were used to confirm the disruption. ndilA trp14 was generated by replacing 
the TRP1 gene of ndilA with LEU2. After selection on the SD-Leu medium, the 
colonies were further streaked on SD-Trp plates to confirm the deletion. 

The vector pADH1-YES2 was modified from the pYES2 vector from 
Invitrogen, in which the GAL1 promoter was replaced with an ADH1 promoter 
for the purpose of constitutive expression of the target gene. pYES2-Ndil and 
pADH1-YES2-Ndil were generated by insertion of the complementary DNA of 
Ndil between the restriction sites EcoRI and Xhol of the vectors pYES2 and 
pADHI1-YES2, respectively. The Ndil mutants were generated by two-step 
PCR and were subcloned into the two vectors using the same approach. To obtain 
pYES2-Myc-Ndil and pYES3-FLAG-Ndil constructs, DNA sequences corres- 
ponding to Myc (EQKLISEEDL) and FLAG (DYKDDDDK) tags were inserted 
into the complementary DNA sequence of Ndil of pYES2-Ndil and pYES3-Ndil 
constructs, respectively, just after its mitochondrial targeting sequence. 

Yeast growth assay. For growth on agar plates, yeasts expressing wild-type and 
mutated NDI] genes in pYES2 or pADH1-YES2 vector were inoculated into liquid 
SD-Ura medium for overnight culture. After normalization of D¢¢0 nm to 0.4, the 
cells were spotted in tenfold serial dilutions on SG-Ura and SD-Ura plates. The 
plates were incubated at 30 °C for three days before the pictures were taken. SD- 
Ura medium contains 2% (w/v) glucose, 0.5% (w/v) ammonium sulphate, 0.17% 
yeast nitrogen base with essential amino acids and other required nutrients except 
uracil. SG-Ura medium contains all the same components as SD-Ura except that 
the 2% (w/v) glucose is replaced with 2% (v/v) glycerol. For growth on agar plates, 
2% agar was added. 

Enzyme assays for Ndil. The enzymatic activity of wild-type and mutant Ndil 
proteins was measured spectrophotometrically using NADH and UQ; as sub- 
strates. The standard assay was carried out at 30 °C in a 1.8-ml reaction mixture 
containing 50 mM MOPS (pH 7.0), 1 mM EDTA and varying amounts of NADH, 
UQ, and enzymes. Reactions were initiated by the addition of the enzyme to the 
reaction mixture. The progress of the reaction was monitored continuously by 
following the formation of NAD* at 340nm, using a Lambda 45 spectropho- 
tometer (PerkinElmer Life Sciences) equipped with a magnetic stirrer in the 
cuvette holder. The activities were determined using an extinction coefficient 
for NADH of 6,220cm~! M7! at 340nm. 

Co-immunoprecipitation experiments. Co-immunoprecipitation was per- 
formed as follows. For wild-type Ndil, ndilA trp1A cells were transformed with 
vectors pYES2-Myc-Ndil and pYES3-FLAG-Ndil. After selection, the cells were 
grown in 50 ml of SGE-Ura medium to exponential phase before the cells were 
lysed by French press with extraction buffer (PBS, 0.1% Triton X-100 and protease 
inhibitor cocktail (Sigma)). The lysates were centrifuged at 15,000g for 30 min at 
4°C, and the supernatants were then incubated overnight with anti-FLAG or 
anti-Myc antibodies (Sigma) conjugated to protein A/G agarose. The agarose 
beads were washed three times with extraction buffer, boiled in SDS-polyacryla- 
mide gel electrophoresis (PAGE) loading buffer, subjected to the SDS-PAGE and 
analysed by western blotting. Co-immunoprecipitation experiments for the trun- 
cated forms of Ndil were performed similarly. 

Membrane fractionation. For membrane fractionation assay, ndilA cells trans- 
formed with pYES2-Ndil vectors were cultured overnight in SD-Ura medium and 
then transferred to SGE-Ura. After growing to the exponential phase, cells were 
collected by centrifugation. The cells were broken in lysis buffer (PBS buffer with 
protease inhibitors), and after centrifugation at 4,300g for 15 min the supernatants 
were obtained as the sample of total proteins. The solution of total proteins was 
then further ultracentrifuged at 250,000g for 1 h, and the resulting supernatant and 
pellet taken as samples of supernatant and membrane fractions, respectively. The 
total protein, supernatant and membrane fractions were subjected to western 
blotting. 

EPR characterization of ubisemiquinone radical. Four different samples pre- 
pared from Ndil proteins extracted using Triton X-100 (Ndil-TRT) or DDM 
(Ndil-DDM) were tested: NADH-UQ,-Ndil-DDM, NADH-Ndil-DDM, 
NADH-Ndil-TRT and UQ;-Ndil-DDM. All EPR samples were prepared in 
anaerobic conditions after purification of the Ndil protein. The Ndil protein 
was added to a final concentration of 501M and incubated with or without 
5mM UQ,. After the addition of 2mM NADH and incubation for 1 min, the 
sample was transferred to an EPR tube before a quick freezing to liquid-helium 
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temperature (around 2K). EPR measurements were conducted on a Bruker 
X-band (9.4GHz) EMSplus10/12 spectrometer. A cylindrical resonator (ER 
4119hs TEO11) was used for data collection. During EPR data acquisition, the 
sample temperature was maintained by an Oxford Instrument ESR 910 liquid- 
helium continuous-flow cryostat. A high-sensitivity EPR spectrum was obtained 
by temperature optimization from 2 to 200K, and most of the EPR spectra were 
collected at an optimized temperature, 40 K. The EPR data acquisition parameters 
were as follows: modulation amplitude, 5.0 G; microwave power, 50 LW; sweep 
time, 40 s; time constant, 81.92 ms; sample temperature, 40 K. 

A power saturation experiment was implemented with the microwave power ran- 
ging from 44 to 14 dB (or 8 .W to 8 mW) with a step size of 2 dB. The power satura- 
tion data was fitted following the equation D= >. ; CV PO + P/Pyj;) °° ,in 
which C; is the relative concentration of the ith free radical, P,,,; is the half- 
saturation parameter and b; is the homogeneity parameter. All the EPR data were 
collected at the High Magnetic Field Laboratory of Chinese Academy of Sciences. 
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Waiting 
A US ‘green-card’ visa can open up career possibilities. 
But getting one requires stamina — anda dash of luck. 


BY KAREN KAPLAN 


enomics researcher Mark Eppinger was 
Ga to crave stability. Since arriv- 

ing in the United States in 2004 from 
his native Germany, Eppinger had completed 
a postdoc and a stint in industry, and had 
been working for five years at the University 
of Maryland in Baltimore in a non-tenure- 
track post. Newly married and with a baby, 
he wanted to find a permanent job, ideally a 
tenure-track academic position, and remain 
in the country. 

But Eppinger was in his fifth year of an H-1B 
non-immigrant work visa, which expired in a 
year and could not be renewed — it has a maxi- 
mum term of six years. Because he hoped to 
remain in the country indefinitely, Eppinger 
decided to apply for an immigrant visa that 
grants permanent-resident status in the United 
States to non-US citizens — the sought-after 
‘green card} so called because it used to be 


for green 


issued as an actual green card. After consulting 
colleagues who had successfully navigated the 
process, Eppinger applied for a self-sponsored 
green card in October 2011 to US Citizenship 
and Immigration Services (USCIS), the federal 
agency that adjudicates applications and issues 
immigrant visas. With the help of an immi- 
gration attorney, he filed his application under 
two of five categories available to those who are 
applying for an employment-based immigra- 
tion visa (see ‘Employer- or self-sponsorship’). 

Although there were no glitches and he 
received his green card with time to spare, 
Eppinger found the process stressful and 
time-consuming. The two categories under 
which he had applied — as an individual of 
‘extraordinary ability’ (“Priority Worker’), and 
as an individual seeking a National Interest 
Waiver — each required substantial docu- 
mentation of his professional achievements. In 
addition, he had to round up recommendation 
letters from half a dozen eminent researchers 


in his field. “That was the most difficult part,” 
Eppinger recalls. He pre-wrote each letter, 
focusing on different accomplishments, then 
waited to get them back. “It took so long,” 
Eppinger says. “Even though the letter’s already 
written, you have to let people freely edit it.” 
Despite the anxiety, however, he is relieved to 
have the card. “You are way more flexible,” says 
Eppinger, now an assistant professor in micro- 
bial genomics at the University of Texas at San 
Antonio, a tenure-track position. “My wife 
and I knew we wanted to stay here — we have 
no intentions of going back to Germany — so 
going for the green card was a no-brainer.” 

There are a daunting number of steps and a 
lot of documentation required to get a green 
card. But applicants can smooth their path by 
giving themselves ample time to understand 
the process, gather the necessary evidence, file 
paperwork and wait for application backlogs 
to clear. 

The green card is highly sought after because 
researchers in the United States who hold any of 
the temporary, non-immigrant visas are limited 
in various ways. Waiting times for getting many 
of them can be six months or more (see Nature 
460, 131-132; 2009). There is the ‘home- 
residence’ rule that requires some visa holders 
to return to their home country for two years 
after the visa expires. Some visas limit travel in 
and out of the United States; others allow the 
visa holder to work only for the employer that 
sponsored the visa. And temporary visas have 
set terms that vary from one to six years. 


DECK OF CARDS 

As a permanent resident with a green card, 
Eppinger is now free from such worries. Green 
cards need to be renewed only once every 
10 years and there are no limits on the num- 
ber of renewals. And green-card holders can 
travel out of the United States for up to 179 
days each year — a boon for researchers plan- 
ning to attend foreign conferences or take part 
in international collaborations. A huge bonus 
for many is that having a green card confers the 
right to apply for all federal grants and fellow- 
ships, whereas those on non-immigrant visas 
are not eligible for some types of funding. 

“If you're on a self-sponsored green card,” 
says immigration attorney Elizabeth Goss of 
Tocci, Goss & Lee in Boston, Massachusetts, 
“you can work for whomever you want or start 
your own company. You're not dependent on 
any employer to sponsor you for a visa. You 
don't have to be beholden to anyone” 

Green-card applicants can pursue an > 
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GREEN-CARD CATEGORIES 


Employer- or self-sponsorship? 


There are five ‘preference’ categories for 
employment-based applications (see 
go.nature.com/v9toa4). 

@ EB-1 Priority workers, those with 
‘extraordinary abilities’. A select few 
researchers, mainly those well established 
in their careers, self-sponsor under this 
category. No labour certification required. 
@ EB-2 Professionals with advanced 
degrees. There are three subgroups: 
Exceptional ability. Applicant must be able 
to show exceptional ability in the sciences 
and meet three out of seven criteria. But 
the applicant must first obtain certification 
from the US Department of Labor, verifying 
that there are insufficient qualified and 
willing US workers to fill the position at the 
prevailing wage. Employer-sponsored. 
Advanced degree. Applicant’s position must 


> employment-based visa, which requires a 
certain level of skills, education and/or work 
experience, or a family-based visa, which 
requires sponsorship by a spouse or relative 
who is a US citizen or legal permanent resi- 
dent. Once a researcher decides to apply for a 
green card, the next determination is the type 
of sponsorship to pursue. For employment- 
based applications, the options are employer 
sponsorship or self-sponsorship. Employer 
sponsorship requires a job offer or an exist- 
ing position; in both cases, the applicant must 
expect to remain in that job for up to two years. 
If they leave that job within six months of get- 
ting the card, it may be revoked. 

There are other restrictions to employer 
sponsorship: employers may choose to spon- 
sor only particular positions, or only employees 
who will be ina particular position for a certain 
length of time. And employers may require an 
employee to work for some time before he or 
she becomes eligible for sponsorship. However, 
the employer generally organizes the process 
and must pay the processing fees and other 
expenses of the application, such as the cost of 
the required medical examination. 

Many scientists, however, choose self-spon- 
sorship, which does not depend on having a job 
or job offer. Immigration experts note other 
benefits: a self-sponsoring applicant can soon 
move to another job — as long as it’s in the same 
field — or launch his or her own company. 

The application steps for employer and 
self-sponsorship are very similar (see ‘Secur- 
ing the visa’). But anyone applying under the 
Priority Worker category (employment-based 
category 1, or EB-1) must prove that they have 
“extraordinary” abilities and are internation- 
ally renowned in their field. A doctorate or 
postdoc experience at an elite US university 


require an advanced degree and applicant 
must be able to prove that they have at 
least five years of post-degree experience in 
their speciality. Applicants require a labour 
certification. Employer-sponsored. 

National Interest Waiver. Applicants request 
that the labour certification be waived 
because their working in the United States 
would be in the national interest. Most 
self-sponsoring researchers apply in this 
subgroup. 

@ EB-3 Professionals, skilled workers and 
unskilled workers. 

@ EB-4 Certain immigrants, including those 
who have served or are serving abroad for 
the US federal government. 

@ EB-5 Immigrants who are investing at 
least US$500,000 in a new commercial 
business that will create US jobs. K.K. 


will not be sufficient. “You've got to show that 
youre one of the small percentage at the very 
top of your field,” says immigration attorney 
Brendan Delaney of Leavy, Frank & Delaney 
in Bethesda, Maryland, who, as a consultant 
for the US National Postdoctoral Association 
in Washington DC, works with many early- 
career researchers. 

Asa result, most early-career researchers 
opt for the National Interest Waiver subcat- 
egory of professionals with advanced degrees 


(EB-2). The other stages are as for an employer- 
sponsored application, but the applicant has the 
responsibility of providing all the information 
needed and of monitoring their application. 
After submitting an I-140, they receive their 
‘priority date’ — their place in line for an immi- 
grant visa number. Once the priority date is cur- 
rent and the I-140 is granted, the applicant must 
then file an I-485 — the final stage. 

Evidence submitted at the I-140 stage of a 
self-sponsored application could include copies 
of published papers along with information on 
first or second authorship, the papers’ citation 
indexes and the impact factor of the journals in 
which they appeared; records of interviews and 
conference talks, especially invited talks; peer 
review undertaken; patents filed, especially ifin 
use or generating interest; and awards, grants 
and fellowships (see ‘Gathering the evidence’). 


BEWARE THE CAVEATS 
All these documents — along with a published 
book and grants in hand — were what it took 
for Dimitar Baronoy, a Bulgarian native with a 
PhD from Boston University in Massachusetts, 
to secure a self-sponsored green card. The card 
was key to Baronov’s plans, and in 2010 he co- 
founded Sterling Point Research in Boston, 
Massachusetts, which develops medical diag- 
nostics. Baronov says that he has far more flex- 
ibility. “Tt would have been really hard to run this 
company if I were still a temporary-visa holder,’ 
he says. “I can apply for federal grants, I have 
latitude for my research and I can create jobs.” 
But applications don’t come cheap. Self- 
sponsoring applicants who hire an immigra- 
tion attorney — and immigration experts 


STEP-BY-STEP 


Securing the visa 


Typically, there are four or five stages to a 
green-card application, whether employer- 
sponsored or self-sponsored. 

@ The employer or self-sponsor must 
determine which of five employment-based 
(EB) immigrant visa categories under 
which to petition (see ‘Employer- or self- 
sponsorship’). 

@ For employer-sponsored green cards, 
the employer must get labour certification 
from the US Department of Labor in 

some cases. It is not necessary under any 
subgroups of EB1 (‘Priority Worker’), or the 
National Interest Waiver subgroup of EB2. 
Certification can take up to eight months. 
@ Next, the employer or self-sponsor must 
file an Immigrant Petition for Alien Worker, 
known as an I-140, to the US Citizenship 
and Immigration Services (USCIS), which 
adjudicates the application and, on 
approval, sends the employer a notice with 
the applicant’s ‘priority date’ — their place 
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in line for an immigrant visa number. The 
agency will also send the approved petition 
to the US Department of State’s National 
Visa Center, where it will remain until an 
immigrant visa number becomes available. 
@ The sponsor must then check the state 
department’s monthly Visa Bulletin 

to determine whether the priority 

date is backlogged. If it is — because 

the applicant’s country of origin is 
oversubscribed — he or she may have to 
wait for months or years. Each country 

has an annual quota, so applicants from 
oversubscribed countries, such as Mexico, 
China, India and the Philippines, can face 
years of delay. 

@ Once the priority date is current, the 
sponsor files an Application to Adjust Status 
(also known as an Application to Register 
Permanent Residence), called the I-485, to 
the USCIS — and the applicant gets their 
green card. KK. 
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recommend doing that — will rack up 
legal and federal processing fees and other 
expenses of up to US$8,000. 

There are other caveats. One is that green- 
card renewal, although usually routine, 
requires the holder to be free of certain 
criminal convictions. Some missteps, such 
as overstaying a previous visa, may render 
the applicant ineligible. Another sticking 
point is that trips out of the country beyond 
179 days require USCIS re-entry permits, 
which provide for absences of up to two 
years. Scientists who travel outside of the 
country on collaborations or fieldwork for 
more than six months at a time should be 
aware that the USCIS might interpret those 
trips as abandonment of their US residence 
and move to deport them. 

Opinions vary on the best time for an early- 
career researcher to pursue a green card. 
Some immigration experts say that it can take 
as long as eight years after a researcher's US 
career launch, depending on the quality of 
their publication and citation records. 

With so much riding on the card, a rush 
to apply can create anxiety. Hou-Sung Jung, a 
research assistant professor of plant biology at 
Dartmouth College in Hanover, New Hamp- 
shire, experienced this first-hand. The South 
Korea native spent seven years as a post- 
doc, mostly on an H-1B visa. Busy with his 
research, publications and a new marriage, he 
forgot about the visa’s expiry date. He was able 
to assemble the documentation and materi- 
als for a green-card application within two 
months and received the card with a month 
to spare. “I was lucky,’ says Jung. 

Some immigra- 
tion experts counsel 
that a researcher 
arriving in the 
United States on 
a temporary visa 
of any type should 
immediately start 
educating them- 
selves on the pro- 
cess and details of 


acquiring the green 

y 2 card, even if they 
Getting don’t expect to want 
the right one. “If you leave 
information Was things too late, you 
a big struggle may find yourself 
in the very in a position where 


beginning.” 
Sadakatsu Ikeda 


you're running out 
of options,” says 
Delaney, who, as a 
native of Northern Ireland, has gone through 
the process himself. 

Applicants may receive conflicting or inac- 
curate information from employers that steers 
them down the wrong path. “At the green-card 
centre at one university, the information on 
the process was so confusing,” says Sadakatsu 
Ikeda, a native of Japan who is a clinical fellow 


TOP TIPS 
Gathering the evidence 


@ Start amassing necessary 
documentation as early as possible: 
birth certificate, academic degrees, 
proof of scientific-society and 
association memberships, invited 
conference presentations, book 
chapters, copies of or links to media 
interviews and their dates, awards and 
dates, published manuscripts, citation 
indexes, journal impact factors. 

@ Network at every chance you get: you 
need to create an association of high- 
level colleagues outside your institution 
with whom you have not collaborated, 
including, for example, contacts from 
conferences or talks. They will be 

your sources for up to five letters of 
recommendation, which will be more 
influential than those from institutional 
or collaborative colleagues. 

@ Write the letters yourself, focusing 
on different accomplishments for 

each that illustrate how you fit into the 
category under which you’re applying. 
Send to the appropriate recipient with 
a stamped, self-addressed envelope 
and provide a deadline. K.K. 


in haematology and oncology at the Univer- 
sity of Michigan in Ann Arbor. “Different 
people told me different things.” Ultimately, 
he decided to pursue the self-sponsored route 
for the freedom to move to different employ- 
ers and to control the process himself. 

Shortly after starting a molecular biology 
and genetics postdoc at Harvard Medical 
School in Boston, Massachusetts, Ikeda hired 
an attorney who showed him how to apply 
under the National Interest Waiver subgroup 
and recommended that Ikeda work on pub- 
lishing citable papers, building an extensive 
network of contacts and getting his name 
widely known in his field. By the time Ikeda 
began to pursue the application seven years 
later, after a three-year residency, he was first 
or second author on a number of papers in 
high-impact journals, had amassed a robust 
contacts network and was well known in can- 
cer research. He had no trouble getting five 
letters of recommendation, and received his 
green card without any hassle six months later. 

“Getting the right information was a big 
struggle in the very beginning, and building 
this network and my reputation — that was 
the biggest obstacle,” says Ikeda. “But in the 
end, I was successful because I had a good 
roadmap.’ # 


Karen Kaplan is assistant Careers editor at 
Nature. 
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PARTNERSHIPS 
UK recruitment drive 


Recruitment of academic researchers is 
under way at Queen's University Belfast 
(QUB), the University of Manchester 
and University College London (UCL) 
in connection with around £1 billion 
(US$1.6 billion) in awards from the UK 
government's Research Partnership 
Investment Fund and associated 
philanthropic donations. QUB will use 
its £32-million share to help hire 500 
researchers and clinicians for its health- 
sciences institute, which includes a 
new research centre for eye disease and 
diabetes slated for completion in 2016. 
Manchester will hire about 20 senior 
academics in 2013 for cancer research, 
and UCL expects to create more posts at 
a new centre for rare paediatric diseases 
scheduled to open in 2018. 


EDUCATION 
Teachers without PhDs 


British postgraduate students might often 
be better qualified than their lecturers, 
according to a study, Academic Staffin UK 
Higher Education Institutions, released 

on 1 November. The report finds that just 
over half of the full-time academic staff 
across all disciplines and institutions have 
doctoral degrees. Author Malcolm Tight 
at Lancaster University, UK, warns that 
the results reflect poorly on the quality 

of postgraduate education at some UK 
universities. He argues that universities 
have focused on diversifying their student 
base at the expense of developing the 
quality of their faculty. “It’s certainly 
possible that a doctoral student might 
have more knowledge of a discipline than 
her or his professor,’ Tight says. 


POSTGRADUATES 
International enrolment 


First-time enrolment of international 
graduate students in the United States 
rose 8% for 2011-12 across all academic 
fields, matching the previous year’s 
increase, according to the latest survey 
by the US Council of Graduate Schools 
in Washington DC. Gains in the sciences 
alone were more modest, with life 
sciences up 1% and physical sciences 4%. 
Business and engineering led all fields 
with gains of 15% and 12% respectively. 
China continues to send increasing 
numbers of graduate students to the 

US; enrolments were up 22% from the 
previous year, the seventh consecutive 
year of growth. 
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LET SLIP THE DOGS 


BY WILLIAM T. VANDEMARK 


market, a woman dressed in black walks 
with a melon balanced on her head. 

“Yeah, baby. William Tell time,” says 
Specialist Browning, a Marine bored out 
of his skull. He and his squad have been on 
duty 100 hours straight, hopped up 
on orexin-z to counter sleep depri- 
vation. 

“Dont even, says Sergeant Knox, 
but Browning’s beam rifle sings, 
melon splatters, and rind flies like 
shrapnel. The woman stands in dis- 
belief, arms spread, while juice — 
thin as artificial blood — drips from 
her abaya. 

“Huzzah, says Browning. The 
woman fixes his position. She ges- 
tures wildly and advances. 

Knox steps from the checkpoint’s 
ceramic-plated kiosk. “Lady, stop 
right there,” he shouts. 

A small crowd gathers. Someone 
tosses a bottle. As it arcs through the 
air, an AI built into an auto-turret atop the 
kiosk assesses the risk. The bottle shatters in 
the street. 

Knox unholsters his pistol and waves it 
at the woman, trying to dissuade her from 
entering the checkpoint’s no-go zone — 
an area mined with cybernetic fleas. He 
hates having to fill out reports when civvies 
need to be deloused. But without pause, 
the woman steps into the orange dust, and 
it comes alive. She stomps, swats, screams 
in frustration. Stepping back, she points at 
Knoxas ifhe is the one to blame and reaches 
into her sleeve. 

Someone flings a tin can at the Marines. 
A magnetometer chatters as if it's a grenade. 
The auto-turret sizzles, spitting streams of 
nano flechettes at hypersonic velocities. The 
can and its contents vaporize; the super- 
heated air rebounds with a thunderclap. 

Knox flinches; his pistol discharges. 

In sudden silence, the woman drops to 
her knees and slumps forward until her 
forehead touches the ground. But this isn’t 
sajdah, a prostration to her God; the back 
of her skull is gone. Tangled hair, flaps of 
skin and remnants of her hajib drape the 
exit wound like a 


I n Abuja, Nigeria, fresh from an open-air 


shroud. In an out- NATURE.COM 
stretched, angled  FollowFutures: 
hand, she clutchesa © @NatureFutures 


melon rind. EF go.nature.com/mtoodm 
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The art of soldiering on. 


The cyberfleas ignore the fruit. They 
gorge on blood. 


Deep within a hospital ship, off the coast of 
Nigeria, Sergeant Knox lies on a polyglass 
table, head encircled by a halo. A psych-tech 
asks for specifics. She wants a description 
of the woman, of the market, of the three- 


legged dog Knox saw earlier that day; she 
wants childhood memories of pets; she 
wants Rorschach associations of the woman's 
blood spatter pattern. 

As she works through Knox's story again 
and again, teasing new details from his narra- 
tive, she walks about an enlarged, holographic 
projection of his brain. Within the stereotac- 
tic model, neural pathways flicker and fluo- 
rescent voxels coalesce: a memory trace. 

“You're on a hippocampus block,’ the tech 
says. “So you wont remember any of this, but 
informed consent requires me to explain that 
when we strip the problematic engram, sec- 
ondary pathways may be affected. Although 
the goal is event-removal, phantom recon- 
struction will occur. The mind heals by filling 
in gaps. But really, you'll be fine. And tomor- 
row, you ll wake up on R&R, unaware of the 
synaptic pruning. No Post Combat Stress 
Disorder for this Marine. Right, Sergeant?” 


In Abuja, Nigeria, fresh from an open-air 
market, a woman dressed in black walks with 
a melon balanced on her head. 

“Oh, baby. I want a taste of that,” says Spe- 
cialist Browning, a Marine bored out of his 
skull. He and his squad have been on duty 
100 hours straight, hopped up on orexin-z. 

“Stand down,’ says Sergeant Knox, but 
Browning lets loose with a wolf-whistle. 

Before the sound dies, a three-legged dog 
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lopes from a shaded doorway. Knox recog- 
nizes it as the stray hed fed while on patrol. 
He laughs. “Yeah, Browning. You sure can 
attract the ladies” 

At Knox’s voice, the dog makes a beeline 
for the kiosk and heads straight into the 
no-go zone. 

“Git!” Knox shouts. 

But the dust awakens. The dog 
yips, bites at its legs, snaps at the air. 
It spins about, activating even more 
of the cybernetic fleas. 

The woman with the melon ges- 
tures at the dog and yells at Knoxina 
language he doesn't understand, but 
her message is all too clear. 

“No,” Knox says. “Not my fault. 
You want him? You get him.” Trying 
to wrangle a feral dog, near rabid 
with pain, ist Knox’s idea of gal- 
lantry. Besides, he’s allergic to the 
fleas’ caustic discharge. 

As if to plead its case, the dog 
howls, its distended voice raw and 
unearthly. 

Jesus, Knox thinks. Why doesn't it 
just run away? 

The bustle of the market has stopped. 
Everyone is watching. Even the kiosk’s auto- 
turret seems transfixed, its barrel jerking in 
micro adjustments as its AI rips through 
algorithms. Assessing. Assessing. Assessing. 

But it won't pull the trigger. Of course not. 
Such dirty work always falls to Knox. Curs- 
ing, he unholsters his pistol and takes aim... 

The dog is dead before it hits the ground. 
It’s disconcerting the way its limbs splay. 
Almost humanlike. 

Knox glances at his pistol. Although smoke 
is curling from its barrel, he doesn't recall 
pulling the trigger. He looks up. His own men, 
the people in the market ... they’re staring at 
him. He catches sight of the woman and for an 
instant it seems as ifher face is alive with bugs. 

The world cants. 

Knox drops to his knees and rocks back 
and forth as waves of guilt wash over him. 

Keep it together, he tells himself. It was 
just a dog. Just a dog. Just a dog. 

He closes his eyes, leans forward and rests 
his forehead on the ground, wishing to God 
none of it were real. m 


William T. Vandemark chases storms, 
photographs weather vanes and prospects for 
fulgurites. His work has appeared in Apex 
Magazine, Intergalactic Medicine Show 
and assorted anthologies. 
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